R 



e 



PORT 



resumes 



ED 016 517 

DEVELOPMENT OF A TECHNIQUE FOR IDENTIFYINS 
CHILDREN’S MUSICAL CONCEPTS. FINAL REPORT. 
BY- ANDREWS I FRANCES M. DEIHLi NED C. 
PENNSYLVANIA STATE UNIV.i UNIVERSITY PARK 
REPORT NUMBER BR-5-0233 
CONTRACT OEC-6-10-002 

EDRS PRICE MF-$0.75 HC-$7.S0 193F. 



PS 000 330 
ELEMENTARY SCHOOL 



PUB DATE SEP 67 



DESCRIPTORS- #MUSIC EDUCATIONi MUSIC TECHNIQUESi ^ELEMENTARY 
SCHOOL STUDENTSi CONCEPT FORMATIONi ^MEASUREMENT INSTRUMENTS- 
STATISTICAL ANALYSISi STIMULUS DEVICESi CURRICULUM 
DEVELOPMENT! ^FUNDAMENTAL CONCEPTS i «MUSICi COGNITIVE 
MEASUREMENT! BATTERY OF MUSICAL CONCEPT MEASURES! 
PENNSYLVANIA 



CURRENT LITERATURE RELEVANT TO MUSIC EDUCATION OF 
CHILDREN STRESSES THE IMPORTANCE OF CURRICULUM DEVELOPMENT 
BASED ON A CONCEPTUAL APPROACH TO MUSIC LEARNING. THERE BEING 
LITTLE UNDERSTANDING OF CHILDREN’S CONCEPTS OF BASIC MUSICAL 
ELEMENTS ON WHICH TO FOUND SUCH CURRICULUM DEVELOPMENT! AN 
INSTRUMENT WAS DEVELOPED TO IDENTIFY THE IDEAS REGARDING 
PITCH! DURATION! AND LOUDNESS WHICH ARE POSSESSED BY 
CHILDREN. TWO GROUP MEASURES ELICITING WRITTEN RESPONSES TO 
(1) WRITTEN STIMULI AND C2) MULTIDIMENSIONAL MUSICAL STIMULI 
WERE DEVISED. IN ADDITION! 2 INDIVIDUAL MEASURES REQUIRING 
(1) THE MANIPULATION OF ELEMENTARY SOUND PRODUCING 
INSTRUMENTS AND (2) OVERT MOVEMENT IN RESPONSE TO MUSICAL 
STIMULI WERE FORMULATED. AFTER PILOT TESTING AND DEVELOPMENT! 
THIS INSTRUMENT WAS EMPLOYED IN A STUDY OF FOURTH-GRADE 
CHILDREN* THE GROUP MEASURES WERE ADMINISTERED TO 429 
SUBJECTS AND THE INDIVIDUAL MEASURES TO 214 SUBJECTS. DERIVED 
SCORES; STATISTICAL EVALUATIONS! AND CORRELATIONS WITH IQ AND 
READING SCORES (REPORTED IN 61 TABLES) DEMONSTRATE THAT! 
ALTHOUGH FURTHER REFINEMENT IS NEEDED BEFORE IT WILL BE 
USEABLE FOR PRACTICAL PURPOSES! THE INSTRUMENT IS ADEQUATE 
FOR RESEARCH PURPOSES. THE DEVELOPMENT OF THIS MEASURE HAS 
SUGGESTED SEVERAL AREAS IN WHICH THE NEED FOR FURTHER 
RESEARCH IS INDICATED. (JS) 



O 



ERIC 






P/1 




iLpk 




|0 FINAL REPORT 

*\ 4- TvTrx Q_n 



^*^E r - ejGGt - No . 5-0233 
Contract No. OE-6-10-002 



UJ 



DEVELOPMENT OF A TECHNIQUE FOR IDENTIFYING ELEMENTARY 
SCHOOL CHILDREN'S MUSICAL CONCEPTS 



CO 

o 

o 

o 

CO 



ERIC 



September 1967 



U.S. DEPARTMENT OF 
HEALTH, EDUCATION, AND WELFARE 



O 



Office of Education 
Bureau of Research 



PS 000330 



U.S. DEPARTMENT OF HEALTH, EDUCATION & WELFARE 
OFFICE OF EDUCATION 



THIS DOCUMENT HAS BEEN REPRODUCED EXACTLY AS RECEIVED FROM THE 
PERSON OR ORGANIZATION ORIGINATING IT. POINTS OF VIEW OR OPINIONS 
STATED DO NOT NECESSARILY REPRESENT OFFICIAL OFFICE OF EDUCATION 
POSITION OR POLICY. 

DEVELOPMENT OF A TECHNIQUE FOR IDENTIFYING ELEMENTARY 
SCHOOL CHILDREN’S MUSICAL CONCEPTS 



Project No. 5-0233 
Contract No. OE-6-10-002 



Frances M. Andrews 
Ned C. Delhi 



September 1967 



The research reported herein was performed pursuant to a 
contract with the Office of Education, U.S. Department of 
Health, Education, and Welfare. Contractors undertaking 
such projects under Government sponsorship are encouraged 
to express freely their professional judgment in the con- 
duct of the project. Points of view or opinions stated do 
not, therefore, necessarily represent official Office of 
Education position or policy. 



The Pennsylvania State University 



University Park, Pennsylvania 



Acknowledgments 



The research project reported herein represents the cooperative 
efforts of many individuals who have made constructive contributions of their 
time and knowledge. Sincere appreciation is due the following; The United 
States Office of Education for supporting the research project; Abraham W. 
VanderMeer, Dean, College of Education, The Pennsylvania State University, 
for unfailing encouragement and expression of interest; Harold E. Mitzel, 
Assistant Dean for Research, College of Education, for expert advice on re- 
search design; Robert L. Lathrop, Chairman of the Graduate Faculty, College 
of Education, for assistance as a test and measurements specialist; and the 
many school music supervisors, directors, and administrators who cooperated 
in the field testing phases of the research. 

Three graduate assistants, Grace E. Laverty, A. Peter Costanza, 
and Cathy J. Cobes, made a major and outstanding contribution to the project; 
in addition, John Peightel, graduate assistant, was helpful in his brief as- 
sociation with the project. 

The competent services of Christy Morgan are gratefully acknowl- 
edged, particularly in recognition of her general help as project secretary 
and as typist of the final report. 



TABLE OF CONTENTS 



Acknowledgments 

List of Tables 

I. Overview of the Study 

Introduction 

Purpose 

Definitions 

Limitations ■ 

II. Review of Related Literature 

Background 

Elements of Music 

Concepts 

Related Studies 

III. Development of Battery of Musical Concept Measures 

Verbal Measure 

University Area Trial • 

Estella Trial 

Central Dauphin Trial 

Pilot Study 

Listening Measure 

Exploratory Trial 

University Groups Trials 

University Area Trial 

Estella Trial 

Central Dauphin Trial 

Pilot Study 

Validation Trial 

Manipulative Measure 

University Area Trial 

Estella Trial 

Bellefonte Area Trial 

Pilot Study 

Matternville Trial 

Validation Trials 

Overt Measure 

University Area Trial 

Estella Trial 

Bellefonte Trial 

Boalsburg Trial 

Pilot Study 

Validation Trial 

Written and Nonwritten Measures 



PAGE 

ii 



1 

1 

2 

2 

4 

5 
5 
5 
7 
9 

13 

13 

13 

14 

15 

16 
17 

19 

20 
21 
22 
22 
23 
26 
27 

27 

28 
28 
29 

31 

32 

32 

33 

34 

34 

35 

36 

38 

39 

45 

45 

45 



IV. Main Study 
Purpose 
Subjects 



iii 



PAGE 



Materials and Procedures 

Results 

Reliability 

Item Analysis 

Correlations and Partial Correlations . 

Validity 

Secondary Analysis of the Overt Measure 

Stanines 

Discussion 

Reliability of Measures 

Item Analysis 

Correlations and Partial Correlations . 

Validity 

Secondary Analysis of Overt Measure , . 



45 

48 

48 

51 

65 

69 

70 
72 
72 
72 

78 

79 

80 
82 



V» Summary j Conclusions and Implications, and Recommendations for 

Future Research 

Summary ! * * ' 

The Problem and Its Background 

Description of Battery of Musical Concept Measures . . , 

Procedures 

Subjects 

Conclusions and Implications 

Recommendations for Future Research ! ! ! * 



83 

83 

83 

84 

85 

86 
86 
87 



References 



89 



Appendices 

A. Statistical Tables from Prepilot Studies ...*!!.*.*!! 

B. Measures Used Prior to Main Study [ ] 

C. Measures Used in Main Study 

D. Information Forms Sent to Schools in Main Study Samples ! ! 

E. Computer Programs Used in Statistical Analyses 



93 

93 

105 

144 

166 

172 



ERIC Document Resume 



185 



LIST OF TABLES 



TABLE page 

I Means, Standard Deviations, and Reliability Estimates 

for the Verbal Measure, Pilot Study 17 

II Verbal Measure Item Analysis , Pilot Study 18 

III Means, Standard Deviations, and Reliability Estimates 

for the Listening Measure, Pilot Study 24 

IV Listening Measure Item Analysis, Pilot Study 25 

V Percentage of Correct Answers to Listening Items, 

High School Students , , 26 

VI Means, Standard Deviations, and Reliability Estimates 

for the Manipulative Measure, Pilot Study 29 

VII Manipulative Measure Item Analysis, Pilot Study 30 

VIII Means, Standard Deviations, and Reliability Estimates 

for the Manipulative Measure, Matternville Trial 32 

IX Means, Standard Deviations, and Reliability Estimates 

for the Overt Measure, Pilot Study 36 

X Overt Measure Item Analysis, Pilot Study 37 

XI Results of Validating Trial, Overt Measure , , 38 

XII Design and Development of the Battery of Musical 

Concept Measures 41 

XIII Means and Standard Deviations of Derived Scores, Written, 

Nonwritten, and Separate Measures, Pilot Study 42 

XIV Strata Fixed Reliability, Strata Reliability, Ratio of 
Subjects by Strata Mean Square to Residual Mean 
Square, and Correlations Between Strata for the 



Written and Nonwritten Derived Scores, Pilot Study .... 43 

XV Symmetric Correlation Matrix, Pilot Study 44 

XVI Pennsylvania School Districts Included in Sample, Main 

Study 46 

XVII Means and Standard Deviations for the Four Measures, 

Main Study 48 

XVIII Means and Standard Deviations for the Derived Scores 

from the Four Measures, Main Study 49 



O 




V 



TABLE 



PAGE 



XIX Means and Standard Deviations of the Written and 

Nonwritten Derived Scores, Main Study ' 49 

XX Strata Fixed Reliability, Strata Reliability, Ratio 
of Subjects by Strata Mean Square to Residual 
Mean Square, and Correlations Among Strata for 
the Four Measures, Main Study , 50 

XXI Strata Fixed Reliability, Strata Reliability, Ratio 
of Subjects by Strata Mean Square to Residual 
Mean Square, and Correlations Between Strata 
for the Written and Nonwritten Derived Scores, 

Main Study 52 

XXII Verbal Measure Item Analysis , Main Study 53 

XXIII Verbal Measure Item Analysis, Main Study 54 

XXIV Listening Measure Item Analysis, Main Study 55 

XXV Listening Measure Item Analysis, Main Study ........ 56 

XXVI Manipulative Measure Item Analysis, Main Study ...... 57 

XXVII Overt Measure Item Analysis, Main Study 58 

XXVIII Written Pitch Item Analysis, Main Study 59 

XXIX Written Duration Item Analysis, Main Study 60 

XXX Written Loudness Item Analysis, Main Study 61 

XXXI Nonwritten Pitch Item Analysis, Main Study 62 

XXXII Nonwritten Duration Item Analysis, Main Study 63 

XXXIII Nonwritten Loudness Item Analysis, Main Study 64 

XXXIV Summary Correlation Matrix for the Four Measures, IQ, 

and Reading for Various Selected Samples, Main Study . . 66 

XXXV Summary Correlation Matrix for the Written and Nonwritten 
Measures, IQ, ;and Reading for Various Selected Samples, 

Main Study gy 

XXXVI Written and Nonwritten Measures, Correlations and Partial 

Correlations Holding out IQ and Reading, Main Study . . 68 

XXXVII Means and Standard Deviations for Reading and IQ Scores, 

Selected Samples, Main Study .............. 69 

XXXVIII Correlation Matrix for the Written and Nonwritten 

Measures (Multitrait-Multiraethod Matrix), Main Study . , 70 

Vi 



TABLE 



PAGE 



\ 

XXXIX Oral Responses Given to Overt Measure Items, Main Study . . • 71 

XL Means, Standard Deviations, and Kuder-Richardson 
Estimates of Agreement for the Overt and Oral 

Responses on the Overt Measure, Main Study 71 

XLI Stanine Conversion Table, Written Fitch 73 

XLII Stanine Conversion Table, Nonwritten Fitch 73 

XLIII Stanine Conversion Table, Written Duration 74 

XLIV Stanine Conversion Table, Nonwritten Duration 74 

XLV Stanine Conversion Table, Written Loudness 75 

XLVI Stanine Conversion Table, Nonwritten Loudness 75 

XLVII Stanine Conversion Table, Verbal Measure 76 

XLVIII Stanine Conversion Table, Listening Measure . . . 76 

XLIX Stanine Conversion Table, Manipulative Measure 77 

L Stanine Conversion Table, Overt Measure 77 

LI Verbal Measure Item Analysis, Estella Trial . 94 

LII Verbal Measure Item Analysis, Central Dauphin Trial 95 

LIII Verbal Measure Item Analysis, Central Dauphin and 

Estella Combined 96 

LIV Listening Items, Percentages Correct, University Groups 

Trials 97 

LV Listening Measure Item Analysis, University Area Trial ... 98 

LVI Listening Measure Item Analysis, Estella Trial 99 

» 

LVII Means and Standard Deviations for the Listening Measure, 

Prepilot Trials 100 

LVIII Listening Measure Item Analysis, Central Dauphin and 

Estella Combined 101 

LIX Manipulative Measure Item Analysis, Estella Trial 102 

LX Manipulative Measure Item Analysis, Belief onte Trial .... 103 

LXI Overt Measure Item Analysis, Estella Trial 104 




vii 



CHAPTER I 



OVERVIEW OF THE STUDY 



Introduction 



Major authorities In the field of music education, In speeches, 
articles, and textbooks published In the past several years, have devoted 
considerable attention to the structurally based, concept-centered music 
curriculum. This reflects a general trend In the search for a more efficient 
and effective method of communicating knowledge, and also Indicates a spe- 
cific Interest In the reexamination of music as a subject to be taught and 
learned under the aegis of the schools. Current literature reveals a search 
for a core of musical content that can be accepted as common to the many 
areas of music education (e.g.. Instrumental, choral, general music, appre- 
ciation, and theory classes) and Is also appropriate as a common basis for 
the musical learning of all segments of the school population. 

Music has been variously described as both an art and a discipline 
with proponents of both camps making strong cases for their views. It la 
both, and derives considerable strength In Its curricular status because of 
this duality. However, a complicating aspect of the duality Is an uncertainty 
regarding the nature and status of the art and discipline. Extensive research 
Into the characteristics of these two facets Is needed to provide a sound 
framework for the music curriculum. 

The present study has Investigated an aspect of music as a disci- 
pline In Its attempt to develop a technique for Identifying children's con- 
cepts of certain musical elements. 

Interest In a conceptual approach to musical learnfcV-. is relatively 
recent. Although current literature relevant to the music ec..;itlon of chil- 
dren repeatedly cites the Importance of basic musical learning to the develop- 
ment of musical concepts (Chapter II), the curriculum for music In the ele- 
mentary school continues to be based primarily on subjective, experiential 
action and judgments rather than on a foundation of empirical evidence. 

The current Interest In the conceptual approach to music has re- 
vealed a gap between the opinions of leading music educators regarding curri- 
culum content and the Identification of a core of knowledge concerning the 
nature of children's concepts of music. A survey of the literature Identi- 
fied few research studies directly related to this topic. Yet It Is para- 
doxical to advocate teaching musical concepts If there Is little basic under- 
standing or Identification of the Ideas children possess regarding music. 

Thus the need for a technique to Identify children's concepts of music Is 
apparent. 



Psychologists have Indicated that the young child does possess many 
concepts, even prior to formal school experience. McDonald (32) writes, 

"The child who Is beginning school has already developed a system of concepts 
and characteristic ways of perceiving and organizing the stimuli from his 



environment" (p.l33). Russell (44) states that a child by the age of three 
or four knows hundreds of concepts (p. 229). Concepts are important to the 
learning process and learning behavior in that they are the foundation of 
problem-solving strategies. Each concept, as it develops, becomes the key- 
stone for further concept development; each concept also interacts with other 
concepts to form complexes and hierarchies of concepts. Concepts are the 
basis of discrimination, value judgment, and generalization. 

Accepting the importance of concepts in organizing learning behavior 
it follows that the child's musical growth will rely heavily on his understand 
ing of the organization and interaction of the structural elements of music. 
Little agreement is apparent among those who have written on the subject re- 
garding which musical concepts should be taught, or at what particular time 
(Chapter II). The basic attributes of any musical tone, however, include the 
dimensions of pitch, duration, loudness, and timbre (pp.5to7); the goal of 
the present study has been to develop a technique for identifying children's 
concepts of three of these dimensions or basic elements, pitch, duration, and 
loudness . 



Through the interrelationships of these elements and their numerous 
and complex extensions of melody, harmony, dynamics, and tonal color, the 
core of a multifaceted, spiral, cyclic music curriculum should evolve. A 
conceptual approach necessarily involves these basic elements. 



Purpose 



In view of the foregoing, the purpose of this study was to develop 
a technique for identifying elementary school children's concepts of pitch, 
duration, and loudness. This involved developing and evaluating measures of 
musical concepts. Past tests have dealt with areas of music aptitude, music 
achievement, and music appreciation, but none has explored the area of musical 
concepts, essentially cognitive in nature. It was believed that the proposed 
technique might reveal significant information regarding children's concepts 
of musical elements, and possess considerable potential as a practical meas- 
ure of childr^ 's growth in musical concept development. 



I 

Definitions 



In this study it is assumed that conceptual understanding of musical 
elements involves recognition of the basic elements of music in terms of their 
function wiihin the musical frame of reference. 

The basic elements of music are pitch, duration, loudness, and tim- 
bre. Their physical manifestations are frequency of vibration, duration of 
vibration, intensity or amplitude of vibration, and complexity of wave-form. 
The psychological dimensions of sound are closely related to the physical 
dimensions, although there is not a one-to-one relationship of frequency to 
pitch, duration of vibration to length of sound heard, amplitude or intensity 
to loudness, and the wave-form to timbre or tone-color. 
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The terin dimension as used in the present study refers to the musi- 
cal elements of pitch (higher/ lower) , duration (faster/slower, longer/shorter), 
and loudness (louder/softer). Because of the complexity of the problem of 
identifying timbre concepts, the dimension of timbre was not included. 

The term multidimensional frame of reference refers to the complex- 
ity of the musical sounds within which the subjects were asked to perceive 
changes or differences. In the Listening and Overt Measures, subjects were 
asked to make a judgment of the change within a single excerpt, or a judgment 
on the difference between two excerpts, within the complex sound of the sym- 
phony orchestra. The music utilized was characterized by variations in pitch, 
duration, loudness, and timbre, constituting the multidimensional frame of 
reference. 



The term predominant change refers to the most obvious or most 
prominent change in the music. Musical excerpts were selected with obvious 
or predominant changes restricted to one dimension or two dimensions. 

The Battery of Musical Concept Measures included four measures, 
each constituting a different stimulus-response combination. The Verbal 
Measure required written response to written (verbal) stimuli. The Listening 
Measure required written response to musical stimuli. The Manipulative Meas - 
ure required a manipulation of the stimulus material by the subject and a 
response by playing or indicating the correct answer. The Overt Measure re- 
quired both an overt -physical movement response and an oral response to musi- 
cal stimuli. The Verbal and Listening Measures, constructed for administra- 
tion in a classroom situation, were group measures . The Manipulative and 
Overt Measures were individual measures . 

Each of the four measurement modes measured the subject's concepts 
of the dimensions of pitch, duration, and loudness. The measures are referred 
to as balanced measures because they contained an equal number of items for 
the measurement of each dimension. 

The Verba"" and Listening Measures, designated as the Written Meas - 
ures , were combined to produce a written score for each dimension. The de- 
rived scores are discussed as Written Pitch , Written Duration , and Written 
Loudness . The Manipulative and Overt Measures, designated as the Nonwritten 
Measures , were combined to produce a nonwritten score for each dimension. 

The derived scores are discussed as Nonwritten Pitch , Nonwritten Duration , 
and Nonwritten Loudness . Phrases such as "verbal pitch," "manipulative loud- 
ness," or "overt duration" refer to the score derived from a single measure 
on one dimension. 

Music - making instruments , as used in the Manipulative Measure, were 
devices or contrived simple instruments on which musical sounds (i.e., sounds 
having perceivable pitch, duration, loudness, and timbre) could be produced 
easily by striking or plucking. In this study such devices included simple 
classroom rhythm instruments and bells, metal plates and metal tubes of 
various sizes, shapes, and densities. 
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Limitations 



The study was limited to the development of a technique for identi ~ 
fylnK elementary school children's musical concepts. No attempt was made to 
determine how children form musical concepts or to measure growth in musical 
concepts. Other limitations were: 

1. The study attempted to identify children's concepts of music in 
terms of the musical elements of pitch, duration, and loudness. Although 
timbre was identified as a musical element, concepts of timbre were not in- 
cluded in this study. 

2. Subjects used in the Pilot and Main Studies were limited to 
children enrolled in the fourth-grade classes of twelve Pennsylvania and one 
Maryland public school systems. Classes were randomly selected from schools 
reporting adequate space for the administration of the measures. Subjects 
were within the normal range of intelligence and hearing. 

3. The musical stimuli employed consisted of musical examples in- 
volving changes in the dimensions being measured. The symphony orchestra was 
the medium used for the musical examples in the Listening and Overt Measures. 
Other instrumental ensembles, solo instruments, and vocal music were excluded. 
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CHAPTER II 



REVIEW OF RELATED LITERATURE 



Background 



This chapter includes a discussion of the elements of music, a gen- 
eral discussion of concepts, a discussion of musical concepts, and a review 
of related studies. The discussion of elements of music is necessary to de- 
velop a meaningful presentation of the study; similarly, the term concepts 
requires an overview, since the researcher investigating an f 
conceptual learning or the identification of concepts is f ^ 

variety of schools of thought and viewpoints, as well as definitions. The 
review of related studies presents the results of an examination of the re- 
search relating to the present study. 



Elements of Music 



Increasing attention has been given in recent books and articles on 
music and music education to the development of musical concepts, but there 
is no general agreement on the identification of the musical elements to be 
emphasized for concept learning, or how concepts relate to such elements. 
Certain authors mention melody, harmony, and rhythm (Sheehy, 47; Leonhard and 
House, 30); other authors include these three, but add form or organization 
as a fourth element (Bergethon and Boardman, 3; Pflederer, 41; Hartshorn, 21, 
LaRue, 29; Monsour and Perry, 37). Ernst and Gary (15) discuss tnelody 
rhythm, harmony, and timbre as musical elements (pp. 17-39); but they inclu e, 
among desired musical outcomes, understanding of the component parts o£ 
music and the interrelationships that exist between melody, rhythm, hamony, 
and form" (p. 6). Hoffer (25) states; "In music there are concepts of 
rhythm, melody, harmony, dissonance, modulation, syncopation, phrase, , 

and maiy more" (p. 125). Woodruff (57) mentions concepts of fo^, rhythm, 
progression, harmony, counterpoint, and "many other precise musical 
(p. 226). Palisca (38) speaks of pitch and rhythm as well as timbre, dynam- 
iL, tempo, duration, form, and style (p. 9). Hemann (24) includes mood 
tone, rhythm, melody, harmony, and form as essential musical learnings, 
cording to Thompson (51) : 



In sound, motion, and design we have the true essentials of the 
musical language. . . . There is no music without these three at- 
tributes. ... The language of music is the language of melody, 
harmony, time values, rhythms, and musical form. All these are 
only variations on the theme of sound, motion, and design Cp. . 



The references cited to this point are characterized by lack of 
viable categorization in that certain writers, covering what is to them famil 
iar ground, either omit one or more of the major musical elements and rele- 
vant musical extensions, or list as elements certain musical phenomena that 
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characterize musical performance or derive from a musical extension o an 
element (e.g., phrase* mood) but are not in themselves an inherent part o 
music in its elemental state. Rather, they are part of the musicianship of 
the composer or performer. 

A different approach to the problem is taken by certain music educa- 
tors, experimental psychologists, and physicists who define the properties of 
tone, or the structural elements of sound, as the basic elements of ^usic. 
Among these authors the nomenclature applied to the various elements differs 
(for example, duration designated as time, loudness as intensity, timbre as 
tone quality or tone color), but there is agreement that a tone has proper- 
ties of pitch, duration, loudness and timbre. Schoen (46) states that all 
vibratory motion has the four properties of frequency, amplitude, a^d 

duration" (p. 5). Both Culver (13, p. 41) and Bartholomew (1, p. 6), writing 
on acoustics of music, name three respects in which musical sounds may differ: 
loudness, quality, and pitch. These are perceptual or sensory dimensions, 
dependent on physical manifestations of the sound wave (stimulus) as 
on the auditory mechanism of the receiver. According to Woodworth and 
Schlosberg (59); 



Loudness is primarily determined by stimulus intensity, even though 
the ear is much more sensitive to some frequencies than to others. 
Fitch is even more closely tied to stimulus frequency, with inten- 
sity having some minor effects. The auditory mechanism is an amaz- 
ingly efficient analyzer of both frequency and intensity in the im- 
portant middle ranges, even though less faithful at the extremes 
(p. 336). 



Woodworth and Schlosberg also explain the correspondence between the physical 
and sensory dimensions: 



To physical intensity corresponds the sensory dimension of loudness} 
to vibration frequency corresponds pitch} to the composition of a 
complex wave corresponds the timbre of a musical instrument and the 
vowel quality of speech (p. 324). 

Moles (36) presents three dimensions: 

A pure, isolated sound is a sonic entity defined by three dimensions, 
which may be physical: amplitude (pressure in baryes « dynes/cm ), 

frequency (cycles per second) , and length (seconds) } or they may be 
perceptual: loudness (decibels), pitch (octaves), and duration 

(log _t) (p. 12). 

Zuckerkandl (60) and Lundin (31) each mention five dimensions of 
tone. Zuckerkandl names pitch, loudness, duration, tone color, and volume} 
Lundin discusses pitch, loudness, timbre, volume, and density. 

Considering these and other opinions, the consensus seems to be 
that a musical tone has the qualities of pitch, duration, loudness, and timbre. 
In the words of Fleming and Veinus (16): "The wave frequency, or the number 

of vibrations per second of the tone, leads to the sensation of Fitch, its 
time length to duration , its energy to intensity Jl^ondnes ^ , and its overtone 
structure to timbre" (p. 5). 
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Bernstein (4) discusses relativity in the subjective measurement of 
tonal characteristics: 

The term pitch refers to the relative highness or lowness of a tone. 

. . . The duration of a musical tone may be defined purely in terras 
of the time elapsing between its commencement and its cessation. 

In actual practice the ear never measures the absolute time inter- 
val. It is, however, continually making comparisons between the 
relative time values of different tones, and between the time 
values of groups of tones. 

Intensity Is the measure of the loudness of a tone. As in the case 
of the two previously mentioned characteristics, our measurement of 
intensity is purely relative. . . . The loudness of a tone for any 
given pitch depends on the amplitude of the vibrations j i.e., the 
extent to which the sounding body moves from its normal position of 
rest as it vibrates to and fro (pp. 1-2). 

For the present study the four elements of pitch, duration, loudness, 
and timbre were accepted as the basic structural elements of music. The meas- 
ures developed, however, dealt directly with pitch, duration, and loudness; 
timbre was excluded from the study. 



Concepts 



An examination of the literature revealed many definitions for the 
term "concept." Woodruff (58) states; 

A concept is a relatively complete and meaningful idea in the mind 
of a person. It is an understanding of something. It is his own 
subjective product of his way of making meaning of things he has 
seen or otherwise perceived in his experiences. At its most con- 
crete level it is likely to be a mental image of some actual object 
or event that the person has seen. At its most abstract or complex 
level it is a synthesis of a number of conclusions he has drawn 
about his experiences with particular things (p. 2). 

Vinacke (54) defines concepts as "cognitive organizing systems which serve 
to bring pertinent features of past experience to bear upon a present 
stimulus-object"(p. 294). Harriman (19) states that a concept is a "mental 
activity which brings two or more situations, experiences, or objects into a 
relationship; also, the sum- total of past experiences brought to bear upon a 
given situation" (p. 80). Hunt (26) quotes Webster’s definition of a concept 
as a "mental image of a thing formed by a generalization from particulars; 
also, an idea of what a thing is to be in general" (p. 1). 

In connection with the consideration of concepts, the distinction 
between "concept" and "percept" should be discussed. Russell (44) indicates 
that a "percept is the awareness of present data rather than a memory or 
image of things past" (p. 66). He classifies a concept as a "more or less 
stable percept" (p. 68). 
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B6relson and Steiner (2) define perception as: 

The more complex process by which people select, organize, and 
interpret sensory stimulation into a meaningful and coherent pic- 
ture of the world. . . . sensation shades into perception as expe- 
rience goes from the isolated and simple to the complex interpreta- 
tions characteristic of normal, ongoing awareness of the world (p. 88). 

The implication is that^Jioncepts result from percepts, and are more stable. 
Russell (44)‘~fftat’es that "concepts develop out of related perceptual experi- 
ences" (p. 117); Smoke (50) refers to concept learning as the "process whereby 
an organism develops a symbolic response which is made to the members of a 
class of stimulus patterns but not to other stimuli" (p. 8). 

Royer (43) lists five phases of developing a concept: sorting, 

classification, ordering, abstraction, and generalization. Harris (20) states 
that the three functions which comprise the process of concept foraation are 
the perception, the abstraction, and the generalization (p. 5). Woodruff s 
(57) three phases of learning a concept are (1) perceptual, sense organs col- 
lect stimuli; (2) conceptual, the brain organizes the stimuli; (3) applica- 
tory, the concept is used (p. 221). 

Several studies indicate that a subject may possess a concept but 
may not be able to verbalize it. Heidbreder (23) found that "concepts were 
often used with consistent correctness though the subject was unable to formu- 
late them verbally" (p. 673). Hebb (22) states: 

A concept is not unitary. Its content may vary from one time to 
another, except for a central core whose activity may dominate in 
arousing the system as a whole. To this dominant core, in man, a 
verbal tag can be attached; but the tag is not essential. The con- 
cept can function without it . . . (p. 133) 

Russell (44) indicates that in some experiments children show a 
clear understanding of a concept but are unable to verbalize it (p. 118). 
However, Chan and Travers (10), in a study in which they showed ambiguous 
visual displays simultaneously with labels either descriptive of the display 
or irrelevant, found that "meaningfulness and relevancy or appropriateness 
of the label attached to the stimulus may be a factor facilitating perceptual 

learning" (p. 65). 

Concept formation involves comparing, discriminating, judging, or- 
ganizing, abstracting, and generalizing. Formation of concepts involves the 
application of previously acquired knowledge and understanding to new situa- 
tions, and the forming of useful and consistent inferences and conclusions. 

Differentiation between percepts and concepts is difficult, for 
percepts form the background of concepts, hunt (26) states that "neither the 
process nor situational differences between perception and conception are 
•all or none' distinctions" (p. 6). The lines between percepts and concepts 
are fine ones, and the learner crosses back and forth from one to the other 
in the process of concept formation. 
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If a hierarchy is involved in the formation of musical concepts, 
one can speculate that it is somewhat less than discrete. It may develop in 
such a manner as this: 

1. Any sound causes the listener to respond in terms of an aural 
sensation (e.g., dropping a book or closing a door is a noise stimulus caus- 
ing the individual hearing it to respond in terms of an aural sensation). 

2. Awareness of a different, highly organized sound, such as a 
musical sound, is a more complex and prolonged sensation. The listener must 
now be aware of a kind or classification of sound that possesses distinguish- 
ing characteristics from such sounds as those of the falling book or the clos- 
ing door. This awareness might be said to be limited initially to the percep- 
tion of a musical tone, a fragmentary stimulus. When perception of a musical 
tone, or a timbre, broadens to include that of many timbres (e.g., different 
voices and Instruments), or when the perception of musical movement broadens 
to include that of rhythmic movements, it is at least close to constituting a 
concept. 



3. Awareness and recognition of a musical entity, such as a song, 
a march, a dance tune, performed in a musical medium (e.g., solo violin, or- 
chestra, piano) as an organized unit. Involves many single discrimination 
percepts which interact to form higher order concepts. 

4. Awareness of the specific musical factors of duration, loudness, 
pitch, and timbre are conceptual functions when recognized in terms of their 
particular operational behavioral function in a musical framework. For pur- 
poses of this study, such a framework is referred to as multidimensional. 



Related Studies 



A review of related research revealed few studies directly pertinent 
to the Identification of children's basic musical concepts. 

Delhi (14) investigated musical concept development of college stu- 
dents through two subtests: basic structure concepts (verbal stimuli) and 

performance quality concepts (aural stimuli). Utilizing regression analysis, 
he found a low, nonsignificant correlation between musical concept develop- 
ment and amount of performance experience; amount was determined by weighted 
questionnaire. The subtest of structural concepts correlated nonsignlflcantly 
with the amount of experience while the subtest of performance concepts cor- 
related low but significantly with amount of experience. 

Il'ina (27) studied musically backward preschool children in an 
attempt to determine "which elements of a melody find reflection in the audi- 
tory conceptions of the child . . . and to what degree motor vocal reactions 
participate in the formation of these auditory Images." A conclusion was 
that motor reactions are important in differentiating auditory imprecisions 
(p. 710). 

Pflederer (40), in a pilot study, investigated certain responses of 
five- and eight-year-olds to certain musical tasks embodying the Piagetian 
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principle of conservation. Each task represented an hypothesis that develop- 
mental trends in the conservation of the specific musical concept (meter, tone 
rhythm) could be discerned through the administration of the task to children 
of different ages. In seven of the eight tasks the eight eight-year-old 
children were better able to conserve the concept in question than were the 
eight five-year-old children. A primary implication of this pilot study was 
that overt interaction of the child with the musical problem seemed to be 
essential. 



In a later paper Pflederer cited findings of Brehmer and Rupp, as 
reported by Franklin (17). Brehmer found tonal thinking of children to be 
dominated by the function of the musical configuration as a whole; young 
children cannot think in terms of abstractions that deal with the various 
parts of a total melodic shape. Rupp found that children before age eight are 
not yet ready to listen both horizontally and vertically at the same time. 

His subjects seemed to perceive only the melody in harmonized music. 

Simon (48), replicating Pflederer 's tasks in a further study with 
six- to nine-year-olds, found a general improvement in performance with in- 
creasing age. Comparing visual conceptual tasks involving seriation and area 
conservation with musical analogues, he found that the musical tasks were 
more difficult than their visual analogues. 

Simon found that pitch discrimination appeared extremely difficult, 
requiring not only the perceptual ability to discriminate between tones but 
also the conceptual understanding of the terms "high" and "low" in music. 
Auditory seriation was an almost impossible task for the large majority of 
subjects, even subjects who demonstrated a conceptual understanding of seria- 
tion in the visual domain. Only in the metric conservation task did any proc- 
esroFlogLal inference seem to be involved. Simon concluded that assessing 
the individual roles of perception and conception in the tasks is difficult. 

Boekelheide (5) designed measures to assess the listening skills of 
eight- and nine-year-olds in areas of sensitivity to rhythmic and melodic 
movement, aural recognition of like and unlike phrases, and aural recognition 
of changes in the pitch level of phrases. She found high correlations be- 
tween musical development, as measured by performance on her tests, and both 
general ability and academic achievement, as measured by standard intelligence 
and achievement tests. The mean scores obtained for the rhythmic response 
measure were higher than for the melodic contour and pitch discrimination 
tests in this battery. Boekelheide believed her tests identified the general 
range of low and high achievers in the listening skills but did not identify 
Individual musical characteristics of subjects. 

Williams (56) pointed out, among other difficulties that character- 
ize studies in pitch discrimination with young children, the problem that 
naming pitches as "high" and "low" is an arbitrary convention which must be 
learned. He used piano for demonstrating this relationship to four- and five- 
year-old children and found all children able to learn quickly and demonstrate 
at the piano relationships of "high" as "upstairs" and "low" as "downstairs.' 
At the conclusion of a short testing period during which children were asked 
to identify pitch changes in five patterns, however, some children were un- 
able to replicate the "upstairs-downstairs" piano demonstration. From a sub- 
sequent study of the ability of four-year-old children to sing songs, he con- 
cluded that, even after a year of daily training in singing, some children 
lacked pitch consciousness. 
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Williams (56) found evidence that "the concept of relative loudness 
has become stabilized in practically all normal children by the time they are 
four years of age, in many cases even younger" (p. 17). In a study which in- 
volved having children reproduce a periodic beat by tapping, Williams found 
failures decreasing from 75 per cent at age three to 4.1 per cent by age six, 
with no failures at ages seven and eight. Williams' studies seemed to indi- 
cate that ability to discriminate and reproduce loudness and rhythmic changes 
develops earlier than pitch consciousness. 

Sievers (56) used children from Grades 1 through 6 in a related 
study using Williams' tapping device. He asked children to reproduce, at 
three different speeds, two patterns: (1) a steady beat and (2) a longer 

note alternating with a shorter one of half the duration. He found that ac- 
curacy in reproducing the patterns varied with age of subjects and speed of 
beat. Younger children had difficulty reproducing either pattern accurately 
at slower speeds, with many children in the lower grades unable to reproduce 
the second pattern at any speed. He found an increasing per cent of correct 
responses through the first four grades, with all children above fourth grade 
correctly reproducing the patterns at any of the speeds used. 

In another study, using seven one-measure rhythmic patterns, Sievers 
found an irregular development of the ability to reproduce the patterns; this 
development correlated positively with age. He also found that correlations 
between chronological age and rhythmic ability were considerably higher than 
correlations between rhythmic ability and intelligence. He concluded that 
rhythmic performance seems to improve somewhat with age but not with intelli- 
gence. 



Riley and McKee (33), in a investigation of pitch and loudness 
transposition, found that of ninety-seven first-grade children only one failed 
to learn the loudness discrimination but twenty-four failed to learn the 
pitch discrimination with a 500-cycle difference. In a study with second- 
and third“grade subjects and adults (42), they found that second- and third- 
graders had difficulty in learning a 100-cycle pitch discrimination but 
easily learned the 500-cycle difference. All were able to learn the loudness 
task easily. Riley and McKee proposed that the more readily learned ampli- 
tude discrimination may be based on previously established mediating responses 
which the subjects bring to the experimental situation. They felt six-year- 
olds to be more familiar with the concepts "louder-softer" than with "higher- 
lower." 



Jeffrey (28) investigated differences in response mode for tonal 
frequency discrimination learning with f ive-and-one-half-year-old children. 

He found that the words "high" and "low" lacked meaning for those children 
when applied to musical tones. His results seemed to indicate that differen- 
tiation of widely-separated pitches was learned more successfully when the 
subjects responded vocally or by playing the note on the piano than when they 
merely pressed a button on the left or right, producing no sound. All sub- 
jects showed inability to transfer from the training interval of three oc- 
taves and a fifth to the smaller interval of a fifth. 

Bridges (7) studied the harmonic discrimination ability of children 
in Kindergarten through Grade 3. She found that with increasing age of sub- 
jects there was a gradual development in the ability to discriminate between 
appropriate and inappropriate harmonic accompaniments to songs. Another 
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finding was that the majority of children at any age are better able to dis- 
criminate harmonically with an unknown than with a known melody. These re- 
sults seem to indicate that children's musical perception is influenced by 
past experiences with music. 

Similar results were found in Ward's (55) study in the area of 
auditory perception. He concluded that individual differences in aural per- 
ception can result from a common stimulus , that aural perception originates 
within the hearer, and that threshold differences cause differences in per- 
ceptions of tone quality. 

In his studies of children's auditory perception of musical sounds, 
Petzold (39) found that children improve on all dimensions with age. His 
longitudinal studies revealed that most children reached a plateau in these 
rhythmic tasks at about third grade and showed no improvement thereafter. 

The plateau in pitch ability was reached by the end of fourth grade, in gen- 
eral, with many children unable to reach the criterion on phrase-reproducing 
tasks at any age. Petzold concludes that age is a significant factor in the 
development of auditory perception. Mainwaring (35), in a study with child- 
ren nine-and-one-half- to eleven-and-one-half-years-old, found that no con- 
sistent relationship exists between perception of pitch differences and per- 
ception of rhythmic patterns, and that age is an important factor in the de- 
velopment of cognitive abilities of this kind. 

Less directly related are studies by Bond and Simpson in the area 
of gross motor performance and locomotor response to rhythmic stimuli. Bond 
(6) computed correlations between scores from the Seashore Test of Rhythm, 
given in the written form and with an apparatus which presented the rhythmic 
patterns in aural, visual, and tactile modes, and scores on measures of motor 
performance and motor learning ability. She found no significant relation- 
ship between rhythmic perception and motor performance or motor learning 
ability. Simpson (49) used an electrical device with two spring-mounted 
platforms connected to a kymograph for objectively recording subject's loco- 
motor (foot movement) responses to various rhythmic stimuli. She found low 
correlation between scores on locomotor response and scores on the time dis- 
crimination and rhythm identification sections of the Kwalwasser-Dykema Music 
Tests. Both studies suggest that cognitive perception of rhythm is not sig- 
nificantly related to motor response to rhythm. 

In summary, it appears that very few studies in musical concepts, 
as measured in the present study, have been undertaken. This absence of 
direct related studies is graphically illustrated in Ittiie Encyclopedia of Edu - 
cational Research (45). In this comprehensive review of concept literature, 
no reference is made to studies of musical concepts. 
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CHAPTER III 



DEVELOPMENT OF BATTERY OF MUSICAL CONCEPT MEASURES 



Verbal Measure 



The Verbal Measure of eighteen multiple-choice items was developed 
for use in a group or classroom situation. Containing six items for measur- 
ing each of the dimensions (i.e., the elements of pitch, duration, and loud- 
ness), the measure involved a comparison and discrimination of natural or 
music-related sounds recalled from prior experience. This task required a 
minimal ability to read and comprehend, as well as a background of experience 
with natural and musical sounds. In the construction of the measure, a 
rigorous attempt was made to limit the comparisons to sounds and the reading 
to words the subjects were likely to have encountered in their everyday 
existence. 



The final form of the Verbal Measure evolved from several test-and- 
revise cycles between July of 1965 and August of 1966. During the summer of 
1965 approximately one hundred verbal items were devised, proposed, and evalu- 
ated by members of the research team. After considerable discussion of con- 
tent and form, the five-answer multiple-choice format was adopted. The use 
of "none of these" as an answer choice was excluded because of the possibility 
of confusing or frustrating subjects with its negative connotation. However, 
the answer "any of these" was included in some items because it seemed to 
allow subjects to demonstrate possession of the concepts. 



University Area Trial. August 1965. 



Purpose . The purpose of this exploratory trial was (1) to try the 
verbal items with elementary school pupils, (2) to discover the length of 
time needed for administering the measure, and (3) to estimate the verbal 
difficulties subjects might encounter with the items. 

Subjects. The subjects in this sample were twenty volunteers from 
communities near The Pennsylvania State University, about equally divided 
between children who would soon enter and those who had recently completed 
fourth grade. 

Materials and Procedures . The measure used in this trial included 
seventeen multiple-choice items". Subjects were instructed to mark one answer 
for each item, choosing the best answer from five choices. The items in- 
cluded in this form appear as Numbers 1 to 17 in Verbal Measure, Form 1 
(Appendix B) . This group required approximately fifteen minutes to complete 

the measure. 

Results, An analysis of results from this trial revealed a very 
high proportion of correct responses, scores ranging from nine to seventeen 
with a mean of 14.75 and a standard deviation of 2.35. 
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Discussion . The subjects gave little indication of difficulty in 
reading and comprehending the items. Because of the nature of the sample 
group, the results obtained in this trial were inconclusive and further 
trials with a more representative sample were projected. Six additional 
items were constructed to provide a pool of items adequate for choosing a 
balanced number of tested items measuring concepts of the three dimensions. 



Estella Trial*. September 1965 



Purpose. The purpose of this trial was (1) to try the items^in the 
Verbal Measure with a non-volunteer group of fourth-grade pupils and (2) to 
obtain data for statistical analysis. 

Subjects . The subjects were the entire fourth grade of thirty-two 
pupils^ in the Loyalsock Area Elementary School, Sullivan County School Dis- 
trict, Estella, Pennsylvania. The area is rural, sparsely-populated, and 
situated a considerable distance from any urban center. 

Materials and Procedures . The measure used in this trial was in 
two parts: Items 1 through 17 comprised one part. Items 18 through 23 a sec- 
ond part (Appendix B, Verbal Measure, Form 1). Because of interruptions in 
the school schedule, only the first part of the measure was administered dur- 
ing the first testing session, and the six additional items were completed in 
a later session the same day. Instructions were to mark the best answer for 
each item, marking only one answer for each. Subjects were instructed to re- 
quest help during the testing if they experienced difficulty reading any 
words. These words were then read to them individually by an examiner. 

Results. The mean score for the trial was 9.65, with a standard 
deviation of 3.86. The scores, divided into the two administration segments, 
showed a mean of 8.27, a standard deviation of 3.40, and a Kuder-Richardson 
reliability^ of .67 for the first seventeen items; a mean of 1.38 and a stand 
ard deviation of 1.26 for the last six items. An item analysis (Appendix A, 
Table LI) for the measure showed item difficulty ranging from .00 to .90 for 
this administration, with two items answered correctly by less than 10 per 
cent of the subjects and two items answered correctly by more than 65 per 
cent of the subjects. The discrimination indices ranged from .00 to .82, 
with two items showing values less than .39. 



^In this and subsequent trials, any discrepancies between the num- 
ber of subjects tested and the "N's" appearing on various statistical tables 
are due to deletion of subjects for one of these reasons: (1) known hearing 

defects, (2) complete data not available, (3) scores or other data not usable 
for a specific analysis, (4) failure to observe specified test procedure. 

^Robert L. Thorndike and Elizabeth Hagen, Measurement and Evaluation 
in Psychology and Education (second edition; New York: John Wiley and Sons, 

Ync., 1961), p. 181. The formula used here, and designated in subsequent 
computations as Kuder-Richardson reliability, is Kuder-Richardson Formula 21. 
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Discussion . As a result of this administration, the general format 
and methods of procedure were accepted for use with subsequent groups. Most 
of these subjects had no difficulty following directions for marking answers, 
and few indicated that they could not read or understand the items. With two 
exceptions, Items 4 and 21, the items discriminated well between the high and 
low scorers, and most items showed an acceptable level of difficulty. 

The problem which seemed most in need of further investigation was 
the apparent inconsistency in performance of individual pupils on the two 
segments of the test. In order to ascertain whether the discontinuous test- 
ing procedure contributed to the inconsistent performance on the items, ad- 
ministration to another sample was planned. 



Central Dauphin Trial, October 1965 



Purpose . The purpose of this trial was (1) to test the twenty- three 
items of the Verbal Measure as a unit in a single, continuous session and (2) 
to obtain additional information on the difficulty and discriminating power 
of the individual items before revising the measure for the Pilot Study. 

Subjects . The subjects were twenty-four pupils in one fourth-grade 
classroom in the Phillips School, Central Dauphin School District, Harrisburg, 
Pennsylvania. This district is part of a large, urban area. 

Materials and Procedures . The measuring instrument for this admin- 
istration was identical to that used in the Estella trial, but was adminis- 
tered in one session (Appendix B, Verbal Measure, Form 1). The administrator 
in this trial provided opportunity for pupils to ask questions concerning 
procedure. 



Results . The mean score for this trial was 10.96, with a standard 
deviation of 4.02, and a Kuder-Richardson reliability of .67. For the first 
seventeen items a mean of 9.46, a standard deviation of 2.9, and a reliability 
of .53 were computed. The final six items showed a mean of 1.5, a standard 
deviation of 1.1, and a reliability of .09. 

An item analysis (Appendix A, Table LII) showed three items answered 
correctly by less than 20 per cent and four items answered correctly by more 
than 80 per cent of subjects. Discrimination indices showed two items below 
.15, three items between .16 and .30, and all remaining items between .30 and 

.85. 



An item analysis computed from the combined scores of the Central 
Dauphin and Estella trials (Appendix A, Table LIII) showed three items an- 
swered correctly by less than 20 per cent and two items answered correctly by 
more than 80 per cent of the subjects. Three items had discrimination in- 
dices below .20, three items were between .25 and .35, and all remaining items 
had discrimination indices from .40 to .79. 

Discussion . A comparison of the scores of the two samples on the 
final six items of the Verbal Measure revealed a very small difference in the 
means of 1.5 and 1.38. This was indication that the items themselves, rather 
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than the testing conditions, were responsible for the difference in the scores 
on the two segments of the measure. 

As a result of the item analysis, six items were discarded as non- 
discriminating or as inappropriate in difficulty for fourth-grade pupils. 

Items were revised or rewritten and one new item constructed to provide the 
needed quota of six items involving concepts of each dimension within the 
balanced measure of eighteen multiple-choice items. Answer choices for each 
item were checked to ascertain which had been functioning effectively as de- 
coys. Alternatives not chosen by subjects were deleted and new choices sub- 
stituted. The eighteen revised items were ordered randomly, a sample item 
included, and written instructions for marking answers developed and added, 
completing the preparation of the measure for the Pilot Study (Appendix B, 
Verbal Measure, Form 2). 



Pilot Study, January 1966 



Purpose . The purpose of this administration was (1) to examine the 
items and procedure for the administration of the measure using a random sam- 
ple of fourth-grade pupils and (2) to obtain data for statistical analysis. 

Subjects . The subjects were fifty-six pupils in two randomly- 
selected fourth-grade classrooms, one rural and one urban, in the Washington 
County, Maryland, schools. 

Materials and Procedures . The instrument used in this administra- 
tion was Verbal Measure, Form 2, as described. The administrator read aloud 
the directions and the sample item. After pupils had checked an answer 
choice, they were told the correct answer. Pupiis were then given an oppor- 
tunity to ask questions relating to procedure. When all questions were an- 
swered, pupils continued with the eighteen items. If individual subjects 
indicated difficulty in reading words in any item, the words were read to them 
by a member of the research team. The total time used to complete the items 
was approximately twenty minutes, although some subjects finished in consid- 
erably less time. 

Results . Means, standard deviations, and a Kuder-Richardson reli- 
ability estimate were computed for the complete measure and for the separate 
dimensions. Table I shows these results for the entire group participating 
in the classroom testing of the Written Measures (N = 56) and also for the 
randomly-selected sample who completed all four measures (N = 38). 

An item analysis (Table II) showed item difficulty ranging from .26 
to .89, with one item answered correctly by less than 30 per cent of subjects 
and two items answered correctly by more than 80 per cent of subjects. Dis- 
crimination indices ranged from .36 to .81, with only one item below .40. 



This district televises elementary music instruction from a central 
location to elementary schools in the system. 
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Discussion . On the basis of item analysis data, items were revised 
to increase' or decrease item difficulty where this deviated substantially 
from .50. Because several of the answer choices appeared not to represent 
easily- identifiable sounds to some subjects, as indicated by the frequent 
selection of certain incorrect answers, two items were deleted. New items 
were devised to replace these; changes were made in the answer choices for 
Item 1, and non-functioning decoys were altered on several other items. All 
items were checked and rechecked for clarity of expression and exactness of 
meaning by members of the research team and by a group of experienced music 
educators, graduate students at The Pennsylvania State University, during the 
summer of 1966. The agreement of the research team and the music educators 
on the items was accepted as evidence of content validity. 



TABLE I 

MEANS, STANDARD DEVIATIONS, AND RELIABILITY 
ESTIMATES FOR THE VERBAL MEASURE 
PILOT STUDY 



Dimension 


Number 
of items 


N = 56 




N = 38 




Mean 


S.D. 


’^ll* 


Mean 


S.D. 


r ** 

’^ll 


Pitch 


6 


3.67 


1.77 


.722 


4.16 


1.65 


.639 


Duration 


6 


3.82 


1.31 


.318 


3.92 


1.57 


.534 


Loudness 


6 


3.53 


1.58 


.495 


3.97 


1.42 


.404 


Total 


18 


10.89 


4.15 


.794 


12.05 


4.01 


.817 



*Kuder -Richard son reliability estimate 
**Strata fixed reliability estimate from RELIB, a computer program 
at The Pennsylvania State University (Appendix E) 



In preparing the final form of the Verbal Measure (Appendix C, Ver- 
bal Measure, Main Study), a standardized wording was used for similar items 
(e.g.. Items 3, 7, 8, and 14) to minimize reading and comprehension difficul- 
ties. Punctuation was standardized, and critical words in several stems were 
underlined to increase the probability that they would be noticed by subjects. 
This form of the measure was accepted for use in the Main Study. 



Listening Measure 



The Listening Measure was developed to measure the subject's ability 
to identify changes in the dimensions of pitch, duration, and loudness within 
the multidimensional frame of reference of orchestral music. In the final 
form it consisted of eighteen short musical items, from four to twenty sec- 
onds in length, excerpted from standard orchestral literature. 
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*Chung-Teh Fan, Item Analysis Table (Princeton, New Jersey: Educational 

Testing Service, 1952). 



These items were of three types: 



1. In the first group of six items, each contained a predominant 
change in one dimension, requiring a single judgment of the predominant change 
within the musical example, e.g., faster, lower, softer. 

2. In the second group of six items, each contained predominant 
changes in two dimensions, requiring a judgment of two changes within the 
example, e.g., higher and louder, faster and softer, lower and slower. 

3. In the third group of six items, each pair of musical excerpts 
differed predominantly in one dimension, requiring judgment of the predomi- 
nant manner in which the second excerpt differed from the first. 

In scoring the final form, items in Groups 1 and 3 received one 
point eachj items in Group 2 received two points each. 

In developing the Listening Measure, musical items (excerpts from 
larger works) were identified as definite examples of change in a single 
dimension or in two dimensions. These excerpts were analyzed and discussed 
as possible items, and decisions made to retain them as acceptable or to 
reject them as ambiguous or as too easy or too difficult. Twenty-four trial 
items were finally accepted by consensus of the research team. These items 
constituted the preliminary measure administered to trial groups at The Penn- 
sylvania State University. 



Exploratory Trial, August 1965 



Purpose . The purpose of this exploratory trial was (1) to discover 
a usable format and procedure and (2) to obtain an objective indication of 
agreement on the predominant changes in the excerpts selected for trial. 

Subjects . The sample consisted of twelve music educators in a 
graduate course in music education at The Pennsylvania State University (des- 
ignated as Mu Ed 574 in Table LIV) . 

Materials and Procedures . The twenty-four items (musical excerpts) 
and one sample item, in taped form, were used for this trial. Answer sheets 
were in the form of numbered blanks on which subjects wrote free-response- 
type answers (Appendix B, Listening Measure, Form 1). Subjects were ill- 
structed to listen to the example and describe the musical change. 

Results. In scoring this measure, each item received one point. 
Scores ranged from eleven to twenty- three, with a mean of 18.1 and a standard 
deviation of 3.77. The percentage of correct answers for twenty of these 
items appears in Table LIV (Appendix A) . 

Discussion. Responses indicated a general misunderstanding of the 
type of judgment desired. To reduce the procedural difficulties, more ex- 
plicit instructions were developed and a multiple-choice answer format de- 
vised. A revised trial measure was developed, eliminating four suspect items 
which had elicited a variety of responses. The revised taped measure of 
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twenty items included an additional sample item and revised spoken directions, 
identical with those on the answer form. 



University Groups Trials, August 1965 



Purpose . The purpose of the trials was (1) to test the revised 
format and procedure and (2) to obtain evidence of agreement or disagreement 
on predominant musical changes in the excerpts from subjects who were not 
music educators. 

Subjects . The two groups of subjects from The Pennsylvania State 
University were: 

1. Eighteen elementary classroom teachers and prospective classroom 
teachers in an undergraduate course in methods for teaching elementary music 
(designated as Mu Ed 86 dn Table LIV) . 

2. Twenty-eight high school students in the chorus of the Summer 
Band-Orchestra-Chorus School (designated as BOG Chorus in Table LIV). 

Materials and Procedures . The revised instrument, including twenty 
items, two sample items, and directions for procedure, was used in these two 
trials, with a copy of the multiple-choice format answer sheet provided for 
each subject. 

Results . The range of correct responses on the twenty items for 
Mu Ed 86 was from six to twenty, with a mean score of 15.7 and a standard 
deviation of 3.22; scores for BOC Chorus ranged from thirteen to twenty, with 
a mean of 17.43 and a standard deviation of 1,72. The percentage of each of 
the three trial groups (Mu Ed 574, Mu Ed 86, and BOC Chorus) correctly answer- 
ing each item appears in Table LIV (Appendix A), A correlation coefficient 
(Pearson's r) of ,804 was computed between the scores of Mu Ed 86 and BOC 
Chorus . 



Discussion . A comparison of scores from Mu Ed 86 and BOC Chorus 
indicated that, although the percentage of the BOC Chorus answering correctly 
was generally higher, the overall performance of the two groups showed con- 
siderable agreement on the relative difficulty of the items. Examination of 
the percentages of all three groups revealed that a substantial majority of 
each group chose the correct answers. This agreement indicated that the pre- 
dominant change in these items was discernible. 

The discrepancy between the scores of the Mu Ed 574 sample and the 
other two groups was a matter of concern. Results seemed to indicate that, 
in addition to the possible effects of change in procedure and a different 
method of answering, the group of music educators may have been noticing as- 
pects of the music differing from those apparent to the two less experienced 
groups . 

Using the results of these first three trials as a guide to usable 
content and format, a revised set of items was prepared for a trial with ele- 
mentary school pupils. Items showing a relatively low percentage of correct 
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answers were omitted, and new items judged acceptable by consensus of the re- 
search team were added to produce a total of twenty-four items. A decision 
was made to remove a stimulus variable by using only orchestral examples. To 
achieve homogeneity of medium, non-orchestral excerpts and orchestral exam- 
ples containing solo passages were deleted, and all Items subsequently chosen 
utilized the orchestral medium. 



University Area Trial, August 1965 



Purpose . The purpose of this exploratory trial was (1) to test 
items and format with elementary school subjects and (2) to identify proce- 
dural difficulties and non-usable Items. 

Subjects . The subjects in this sample were twenty volunteers from 
communities near The Pennsylvania State University, about equally divided be- 
tween children who would soon enter and those who had recently completed 
fourth grade. 

Materials and Procedures . The measure used in this trial consisted 
of twenty-four items. It included a tape containing twenty-four musical ex- 
cerpts, spoken directions, sample excerpts, and spoken citation of item num- 
bers, as well as answer sheets with directions for marking answers (as spoken 
on the tape) and five answer choices for each Item (Appendix B, Listening 
Measure, Form 3). Subjects were instructed to listen to each item, choose 
the best answer, then listen again before marking the answer. 

Results . The results of this trial revealed a high proportion of 
correct answers, scores ranging from eleven to twenty-two, with a mean of 
17.4 and a standard deviation of 3.38. The results of an Item analysis for 
this trial 'appear In Table LV (Appendix A). An examination of this data 
showed that only three of the twenty-four items were answered incorrectly by 
more than 50 per cent of the subjects. 

Discussion . In evaluating the results of this trial, the research 
team considered a number of possible reasons for the high mean. Among these 
were: 



1. The non-random technique used in recruiting subjects biased the 

sample. 

2. The presence in the sample of a substantial proportion of chil- 
dren entering fifth grade Inflated the mean. 

3. The measure in this form was too easy for fourth-grade children. 

To further investigate these possibilities, it was decided to obtain 
a more representative sample of fourth-grade children for another trial. 

After careful review of the procedural difficulties encountered dur- 
ing the several early trials of the Listening Measure, a new tape was devel- 
oped Including revised directions and additional sample Items. It was decided 
that the final form of the Listening Measure would consist of eighteen items, 
six requiring a judgment of a single change within each excerpt, six requiring 
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a iudgment of two concurrent changes within each excerpt, and six r ng 

iudgment of a single difference between the two examples in a pair of excerpts. 
The first and third groups were to contain one item for each of six possi e 
changes; higher, lower, louder, softer, faster, slower. For the second 
group, items with two changes, six of the possible answer combinations were 
Llected randomly, within the requirement that resulting combinations provide 
an equal number of answers for each dimension (i.e., pitch, duration, loud- 
ness). In this third trial instrument, however, the full twenty- four items 
from the previous trial were included to provide a larger number of tested 
items for analysis and to provide a basis for comparison of scores from the 

two trials. 



Estella Trial. September 1965 



Purpose. The purpose of this trial was (1) to test the listening 
items with a non-volunteer group of fourth-grade children and (2) to obtain 
data for statistical analysis. 

Subiects. The subjects were the entire fourth grade of thirty-two 
pupils in the Loyalsock Area Elementary School, Sullivan County School Dis- 
trict, Estella, Pennsylvania. 



Mnfcer tals and Procedures . The measure used for this administration 
was the form revised after the preceding trial (Appendix B, Listening Measure, 
Form 3). The twenty-four items were unchanged, but additional sample items 
and directions were added. In the administration, the experimenter gave gen- 
eral directions in addition to the instructions on the tape, and assisted 
pupils with the sample items. 



Results. The mean score for this trial was 9.50 correct responses, 
with a standard deviation of 4.84 and a Kuder-Richardson reliability estimate 
of .787. Results of an item analysis showed item difficulty ranging from 
.05 to .70 and discrimination indices ranging from -.27 to .93. With the 
exception of four items, all were .39 or above in discrimination (Appendix A, 

Table LVI) . 

Discussion. The mean score for this sample was substantially lower 
than the 17.4 obtained in the University Area sample. This may have been due 
in part to the non-select nature of the Estella group, which included all 
pupils in the fourth grade, to the difference in maturity between fourth and 
fifth graders, to the difference between the two groups in musical background 
and general cultural envirovmient, or to a combination of these factors. A 
third trial was scheduled using an urban sample to further test the items. 



Central Dauphin Trial. October 1965, 



Purpose . The purpose of this trial was (1) to test the items in 
the Listening Measure with a second non-volunteer group of fourth-grade sub- 
jects and (2) to obtain additional information on the difficulty and discrimi- 
nating power of the individual items before revision of the measure for the 

Pilot Study. 
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Sub.jects . The subjects were twenty-four pupils in one fourth-grade 
classroom in the Phillips School, Central Dauphin School District, Harrisburg, 
Pennsylvania. 

Materials and Procedures . The instrument and procedures for this 
trial were identical to those used in the Estella trial. 

Results . The mean score for this trial was 11.96, with a standard 
deviation of 4.46 and a Kuder-Richardson reliability estimate of .73. Scores 
from this sample and the Estella trial were combined to provide a larger 
sample. The mean score for the combined sample was 10.59, with a standard 
deviation of 4.79 and a Kuder-Richardson reliability estimate of .77 (Appen- 
dix A, Table LVII) . An item analysis was computed for the combined data 
(Appendix A, Table LVIII) . 

Discussion . The means from the Estella and Central Dauphin trials 
indicated that the Listening Measure, as a whole, was of appropriate diffi- 
culty for fourth-grade children. The results of the item analysis were used 
in selecting items for use in the Pilot Study; those items showing a low 
discrimination index or a high difficulty level were not included in the re- 
vised form of this measure. 

Using the best items from this trial measure, as determined by the 
item analysis, and adding new ones as necessary, a balanced measure of eight- 
een items was constructed. This measure contained six single excerpts re- 
quiring single answers, six single excerpts requiring double answers, and six 
pairs of excerpts for comparison requiring single answers. Because of the 
six double-answer items, the total possible score on the measure was twenty- 
four. The measure was balanced with eight items involving pitch concepts, 
eight duration concepts, and eight loudness concepts (Appendix B, Listening 
Measure, Form 4). 



Pilot Study, January 1966 



Purpose . The purpose of this administration was (1) to examine the 
items and procedure for the administration of the measure using a random sam- 
ple of fourth-grade pupils and (2) to obtain data for statistical analysis. 

Subjects . The subjects were fifty-six pupils in two randomly- 
selected fourth-grade classrooms, one rural and one urban, in the Washington 
County, Maryland, schools. 

Materials and Procedures . The measure used in this trial consisted 
of eighteen items. It Included a tape containing eighteen musical excerpts, 
spoken directions, sample excerpts, and spoken citation of item numbers, as 
well as answer sheets in five-alternative multiple-choice form. (The musical 
source of items, the directions for administration, and an answer sheet appear 
in Appendix B, Listening Measure, Form 4.) Subjects were encouraged to ask 
questions concerning the procedure after each set of instructions and sample 
items was completed. The testing time was approximately twenty-five minutes, 
including time for directions and sample items. 
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Results . Means, standard deviations, and a Kuder-Richardson relia- 
bility estimate were computed for the complete measure and for the separate 
dimensions. Table III shows these results for the entire group participating 
in the classroom testing of the Written Measures (N = 56) and for the randomly- 
selected sample completing all four measures (N = 38). 



TABLE III 

MEANS, STANDARD DEVIATIONS, AND RELIABILITY 
ESTIMATES FOR THE LISTENING MEASURE 
PILOT STUDY 



Number 

Dimension of items 


N = 56 


N = 38 


Mean S.D. 


Mean S.D. r^^** 


Pitch 8 
Duration 8 
Loudness 8 


4.32 2.23 .69 
5.04 1.92 .56 
5.40 1.87 .57 


5.05 2.32 .76 
5.47 1.75 .60 
6.13 1.14 .34 



*Kuder-Richardson reliability estimate 
**Strata fixed reliability estimate from RELIB, a computer program 
at The Pennsylvania State University (Appendix E) 



An item analysis (Table IV) showed item difficulty ranging from .26 
to .85, with one item answered correctly by less than 30 per cent of subjects 
and one answered correctly by more than 80 per cent of subjects. Discrimina- 
tion indices ranged from .21 to .89. 

Discussion . In preparing the Listening Measure for the Main Study, 
items were examined with reference to the results of the Pilot Study item 
analysis. It was decided (1) to delete any items with a discrimination index 
less than .30 and (2) to examine items with a proportion of correct answerff 
smaller than .30 or greater than .75 for possible replacement. 

After revisions were made, a trial instrument was administered to 
seventeen music educators during the summer of 1966 for preliminary valida- 
tion (Appendix B, Answer Sheet, Preliminary Validation). With the results of 
this procedure and the Pilot Study as a guide, three of the items were re- 
placed and the directions were revised. It was also decided that more time 
was needed between items, and that ten seconds was sufficient to give subjects 
time to read answer choices, decide, and mark the correct answer. (Revised 
directions appear in Appendix C, Listening Measure Administration, Main Study.) 

The spoken direction, "Mark your answer for number was included on the 

tape after the second playing of each item. 

The final form of the Listening Measure (Appendix C, Listening Meas- 
ure, Main Study) was prepared with the cooperation of the Division of Instruc- 
tional Services of The Pennsylvania State University. 
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Validation Trial. August 1966 



Purpose. The purpose of this trial was (1) to test the revised 
answer-sheet format and the procedure for administration and (2) to ascertain 
whether the Listening Measure items selected for the Main Study would elicit 
correct judgments of a musical change under test conditions. 

Subjects . The subjects for this trial were forty-five high school 
students participating in the Summer Music Clinic at The Pennsylvania State 
University. 

Materials and Procedures . The Listening Measure in the form to be 
used in the Main Study, consisting of five sample items and eighteen test 
items, was used for this trial. Answer sheets and administration procedures 
followed the revised forms (Appendix C, Listening Measure, Main Study). 

Results . Table V shows the percentage of students correctly answer- 
ing the items of the measure. 



table V 

PERCENTAGE OF CORRECT ANSWERS TO LISTENING ITEMS 



HIGH SCHOOL STUDENTS 
N = 45 


Item Per cent 

number correct 


Item Per cent 

number correct 


Item Per cent 

number correct 


1 100.0 

2 95.6 

3 97.8 

4 82.2 

5 100.0 

6 100.0 


7 95.6 

8 100.0 

9 300.0 

10 100.0 

11 100.0 

12 84.4 


13 93.3 

14 97.8 

15 91.1 

16 77.8 

17 97.8 

18 88.9 



Discussion . A high proportion of correct answers was achieved by 
this group. The results indicated that the items did change predominantly in 
the specified dimensions, and the listeners could perceive these changes 
under conditons similar to classroom testing. This was accepted as evidence 
of content validity. Instructions, content, and format of answer sheet 
worked satisfactorily with the group. A decision was made to utilize the 
Listening Measure in this form (items, answer sheets, directions, and proce- 
dures) for the Main Study. 
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Manipulative Measure 



The Manipulative Measure was developed for use with individual sub- 
jects and provided a mode of demonstrating understanding of the musical con- 
cepts of pitch, duration, and loudness through the manipulation of simple 
music-making devices (e.g., triangle, resonator bells, finger cymbal). Items 
were devised to measure these concepts, and promising items were selected by 
consensus of the research team. 

After several trials and revisions, the final form of the Manipula- 
tive Measure contained eighteen items, six items involving each of the dimen- 
sions of pitch, duration, and loudness. Items were designed so that the 
manual dexterity required for performing each test item was minimal. 



University Area Trial. August 1965 



Purpose. The purpose of this exploratory trial was (1) to try the 
items devised and estimate their effectiveness and (2) to try the procedure 
for the administration of the measure. 

Subjects. The subjects were eleven volunteer children, from commu- 
nities near The Pennsylvania State University, who would soon enter or had 
recently completed fourth grade. 

Materials and Procedures. The measure used in this trial con- 
sisted of fifteen items. Subjects were encouraged to experiment and become 
familiar with the music-making devices. Each task was explained by the ex- 
perimenter, and the subject attempted to perform the required task (Appendix 
B, Manipulative Measure, Form 1). One experimenter administered the measure 
to individual subjects and judged the responses right or wrong for a possible 
total of fifteen points; other experimenters observed the administration of 

the measure. 

Results. The results of this trial showed a high proportion of ^ 
correct answers, with scores ranging from nine to fifteen, a mean of 13*27, 
and a standard deviation of 1.77. Items were examined and several were dis- 
carded because of failure to elicit responses from the subjects. 

Discussion. New items were devised, examined, and incorporated into 
the measure, which was then balanced with six items measuring pitch, six dura- 
tion, and six loudness concepts, a total of eighteen items (Appendix B, Mani- 
pulative Measure, Form 2). The procedure described above seemed to be effec- 
tive in permitting subjects to demonstrate possession of the concepts and was 
accepted for subsequent use. Since the subjects in the first trial had been 
volunteers, further trials with more representative subjects were planned. 



Estella Trial, September 1965 



Purpose . The purpose of this trial was (1) to test the items with 
a non-volunteer group of fourth-grade children and (2) to provide data for 
statistical analysis. 

Subjects . The subjects were the entire fourth-grade of thirty-two 
pupils in the Loyalsock Area Elementary School, Sullivan County School Dis- 
trict, Estella, Pennsylvania. 

Materials and Procedures . The measure used in this trial consisted 
of eighteen items (Appendix B, Manipulative Measure, Form 2). Time for ad- 
ministration of the measure varied from ten to fifteen minutes. The items 
were judged right or wrong, with two experimenters alternately judging and 
observing. 



Results . Results of this trial showed a mean of 12.58, a standard 
deviation of 2.92, and a .Kuder-Richardson reliability estimate of .574. An 
item analysis (Appendix A, Table LIX) showed that loudness items had discrimi- 
nation indices ranging from .00 to .72; pitch items had indices ranging from 
.63 to .93; duration items had indices ranging from .00 to .82; and all items 
showed item difficulty ranging from .24 to 1.00. 

Discussion . This trial provided the research team with an evalua- 
tion of the items in the measure. It was decided to delete Item 7 and de- 
crease from six to four the number of resonator bells used in Items 13 through 
18; these items had proved too difficult. New items were developed and incor- 
porated into a revised measure (Appendix B, Manipulative Measure, Form 3). 



Bellefonte Area Trial, October 1965 



Purpose . The purpose of this trial was to test the effectiveness 
of the items prior to their incorporation into the measure for the Pilot 
Study. 



Subjects . The subjects were twenty randomly-selected pupils from 
a fourth-grade classroom in the Bellefonte Elementary School, Bellefonte Area 
School District, Bellefonte, Pennsylvania. 

Materials and Procedures . The measure used in this trial consisted 
of eighteen items, six measuring pitch, six duration, and six loudness con- 
cepts (Appendix B, Manipulative Measure, Form 3). Time for administration of 
the measure varied from ten to fifteen minutes. The experimenter judged the 
responses right or wrong. 

Results . The results of this trial showed a mean of 11.3, a stand- 
ard deviation of 2.44, and a Kuder-Richardson reliability estimate of .31. 
Item analysis (Appendix A, Table LX) indicated that most of the items dis- 
criminated with this group. 
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Discussion . Since the item analysis indicated that most of the 
items discriminated with this group, the Manipulative Measure, Form 3, was 
accepted for the Pilot Study. 



Pilot Study, January 1966 



Purpose . The purpose of the Pilot Study was (1) to examine the 
items and the procedure for the administration of the measure using a random 
sample of fourth-grade pupils and (2) to obtain data for statistical analysis. 

Subjects . The subjects were thirty-eight randomly-selected pupils 
from two randomly-selected fourth-grade classrooms in two schools, one rural 
and one urban, in the Washington County, Maryland, School District. 

Materials and Procedures . The measure consisted of eighteen items 
with six items measuring pitch, six duration, and six loudness concepts (Ap- 
pendix B, Manipulative Measure, Form 3). Subjects were encouraged to experi- 
ment and become familiar with the music-making devices. After each task was 
explained by the experimenter, the subject attempted to perform the required 
task. Items were judged right or wrong for a possible total of eighteen 
points. Time for administration of the measure varied from ten to fifteen 
minutes. 



Results . The means, standard deviations, and Kuder-Richardson reli- 
ability estimates of the Pilot Study are shown in Table VI. 



TABLE VI 

MEANS, STANDARD DEVIATIONS, AND RELIABILITY 
ESTIMATES FOR THE MANIPULATIVE MEASURE 
PILOT STUDY 
N = 38 



Dimension 


Number 
of items 


Mean 


Standard 

deviation 


Reliability 

estimate* 


Pitch 


6 


3.08 


1.95 


.872 


Duration 


6 


4.90 


1.29 


.535 


Loudness 


6 


5.74 


.50 


.03 


Total 


18 


13.71 


3.09 


.696 



*Strata fixed reliability estimate from RELIB, a computer program 
at The Pennsylvania State University (Appendix E) 



Item analysis (Table VII) showed the pitch items had discrimination 
indices ranging from .62 to .88, duration items had indices ranging from .39 
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*Fan table 



r 
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to .84, loudness items had indices ranging from .00 to .53, and all items 
showed item difficulty ranging from .26 to .85. 

Discussion . The procedure described was satisfactory in permitting 
subjects to demonstrate possession of the concepts. The item analysis indi- 
cated that the loudness items and two of the duration items had unacceptable 
difficulty and discrimination indices. Procedures for the revision of these 
items included: 

1. Changing the directions for Item 11 from "Which one is loudest?" 
to "Play from the loudest to the softest." 

2. Changing the directions for Item 13 from "Find the softest 
position." to "Find the softest, medium, and loudest positions." 

3. Changing the directions for Item 15 from "Which set is loudest?" 
to "Play in order from softest to loudest." 

4. Changing the directions for Item 17 from "Which is softest?" to 
"Play in order from loudest to softest." 

5. Re-examining the chord organ for a possible loudness item. 

6. Re-examining all items to confirm the presence of at least two 
other dimensions to function as decoys for the dimension being measured. 

Items were re-ordered and this revised measure (Appendix B, Manipu- 
lative Measure, Form 4) was submitted to a field trial. 



Matternville Trial, May 1966 



Purpose . The purpose of this trial was to examine the revised items. 

Subjects . The subjects were eleven randomly-selected fourth-grade 
pupils from a classroom in the Matternville Elementary School, Matternville, 
Pennsylvania. 

Materials and Procedures . The measure used in this trial contained 
eighteen items with six items involving concepts of pitch, six duration, and 
six loudness (Appendix B, Manipulative Measure, Form 4). The responses to the 
items were judged either right or wrong for a possible total of eighteen 
points. Subjects were encouraged to experiment and become familiar with the 
music-making devices. After each task was explained by the experimenter, the 
subject attempted to perform the required task. Time for administration of 
the measure varied from ten to fifteen minutes. 

Results . The means, standard deviations, and Kuder-Richardson reli- 
ability estimates for the total measure and the dimensions of pitch, duration, 
and loudness are shown in Table VIII. 

Discussion . An examination of the results indicated that the loud- 
ness items needed further revision. Items 7 , 11, and 13 were deleted from 
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the measure, and the consideration of new loudness items continued. The Dyna- 
level, an electronic apparatus measuring discrete variations in the loudness 
of sounds, was used to estimate the loudness levels of various music-making 
devices. On this basis, five items were tentatively accepted for possible 
incorporation into the measure. 



TABLE VIII 

MEANS, STANDARD DEVIATIONS, AND RELIABILITY 
ESTIMATES FOR THE MANIPULATIVE MEASURE 
MATTERNVILLE TRIAL 
N = 11 





Dimension 


Number 
of items 


Mean 


Standard 

deviation 


Reliability 

estimate* 



Pitch 


6 


2.91 


1.31 


.334 


Duration 


6 


4.36 


1.61 


.687 


Loudness 


6 


5.36 


.64 


.193 


Total 


18 


12.64 


2.71 


‘.649 



*Kuder-Richardson reliability 



Validation Trials, August 1966 



The five new loudness items were submitted for validation to eleven 
faculty members and graduate students in music education (a jury of experts) 
at The Pennsylvania State University. The criterion for acceptance was agree- 
ment of at least nine of the eleven judges. As a result of this procedure, 
three items were accepted and incorporated into the measure. 

The revised measure was then submitted to a second group of experts 
for validation. Results of the validation showed there was unanimous agree- 
ment by the experts on each item; this was accepted as content validity. The 
measure was accepted for the Main Study (Appendix C, Manipulative Measure, 

Main Study) . 



Overt Measure 



The Overt Measure was developed to provide individual subjects with 
a mode of demonstrating understanding of the musical concepts of pitch, dura- 
tion and loudness by responding with bodily movements to selected excerpts 
from orchestral literature. Musical examples were identified as items con- 
taining a predominant change in the musical dimensions being measured. These 
excerpts were analyzed and chosen by consensus of the research team and in- 
corporated into the measure. 
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In the final fora the Overt Measure consisted of nine musical ex- 
cerpts ranging from thirteen to twenty-five seconds in length. Subjects were 
judged for any change of movement in response to the predominant change in 
the music and for an oral response to the change in the music. 



University Area Trial, August 1965 



Purpose. The purpose of this exploratory trial was (1) to examine 
several possible procedures for the administration of the measure and (2) to 
determine the difficulty and effectiveness of the items. 

Subjects. Subjects were eleven volunteer children from the area 
near The Pennsylvania State University who would soon enter or had recently 
completed fourth grade. 

Materials and Procedures . The measure contained seventeen excerpts 
(Appendix B, Overt Measure, Fora 1). Various procedures and different experi- 
menters were used to elicit responses from the subjects. Among the procedures 
were: 



1. The subject listened to the first playing of the item and moved 
to it on the second playing. Although this basic method was used for all 
subjects, some subjects heard the excerpt an additional time because of fail- 
ure to respond on the second playing. 

2. The subject observed while the experimenter demonstrated possi- 
ble movements, showing a change where the music changed. The subject was 
then encouraged to move to the music. 

3; Beginning with the fourth subject, the first six items were 
omitted and the measure initiated with Item 7, which was more rhythmic and 
had an impelling effect on the movements of. the subjects. 

Movement was observed to determine (1) the presence of change in 
movement with the change in the music, and (2) the type of movement exhibited 
by the subject, e.g. smooth to jerky, fast to slow, etc. 

Results . Due to the exploratory nature of this trial, in which 
various procedures were employed, statistical analyses were not appropriate. 

Discussion. Items 1 through 6 were deleted because these items did 
not elicit movement from the subjects. A revision of the measure was devel- 
oped using Items 7 through 17 (Appendix B, Overt Measure, Fora 2). Subjects 
were now to be judged on changing movement when the music changed (one point) 
and on the appropriateness of movement to the change in the music (one point). 
The procedure in which the subject listened to the excerpt on the first play- 
ing and moved to the excerpt on the second playing was adopted for subsequent 

use . 
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Estella Trial, September 1965 



Purpose . The purpose of this trial was to examine the difficulty 
and effectiveness of the items with fourth-grade pupils. 

Subjects . The subjects were an entire fourth grade of thirty-two 
pupils in the Loyalsock Elementary School, Sullivan County School District, 
Estella, Pennsylvania. 

Materials and Procedures . The measure used in this trial consisted 
of eleven items, each item receiving two points for a possible total of 
twenty-two points (Appendix B, Overt Measure, Form 2). Subjects were judged 
on any change of movement to the predominant change in the music (one point) 
and on appropriateness of movement to this change (one point). Subjects were 
instructed to listen to the music on the first playing and move to the music 
on the second playing. 

Results. The results showed a mean of 13.0, a standard deviation 
of 2.60, and a Kuder-Richardson reliability of .217. An item analysis (Ap- 
pendix A, Table LXI) showed that all but one of the items had discrimination 
indices ranging from .63 to .93 and difficulty indices ranging from .37 to 

.75. 



Discussion . Although the item analysis was favorable, it was not 
certain whether the response changes resulted from a change in the style of 
music or from a change in pitch, duration, or loudness. New items were iden- 
tified and selected with particular attention to the problem of a single 
change in the music. The number of items in this revised measure was limited 
to nine because seventeen items had proved too fatiguing and time consuming. 
The measure (Appendix B, Overt Measure, Form 3) was balanced with three items 
measuring pitch, three duration, and three loudness concepts. 

A classroom orientation was to be included prior to the administra- 
tion of the measure to acquaint the subjects with possible physical responses. 
In this orientation a sample item would be played and verbal responses elic- 
ited from the subjects as to possible movement. 



Bellefonte Trial, October 1965 



Purpose . The purpose of this trial was to examine the judging of 
appropriateness of the overt response. 

Subjects . The subjects were twenty randomly-selected pupils from a 
fourth-grade classroom in the Bellefonte Elementary School, Bellefonte Area 
School District, Bellefonte, Pennsylvania. 

Materials and Procedures . The measure used in this trial contained 
nine items, each item receiving two points for a possible total of eighteen. 
Subjects were judged on any change of movement in response to the predominant 
change in the music (one point) and on appropriateness of movement to this 
change (one point). Subjects were instructed to listen to the music on the 
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first playing and move to the music on the second playing. Three judges rated 
certain subjects; two judges rated all twenty. 

Results . The test for significant differences among judges, com- 
puted by an analysis of variance, yielded an F ratio of 1.19 (not significant 
at the .05 level) for the two judges, and an F of 3.58 (significant at the 
.05 level) for the three judges. 

Discussion . Since the lack of agreement on the appropriateness of 
change contributed to the variability among judges, the judgment of this re- 
sponse was deleted. It was proposed that an oral response be substituted for 
the judgment of appropriateness. In this response the subject would be asked 
to indicate verbally what change occurred in the music. The six possible 
choices (higher, lower, louder, softer, faster, slower) would appear on a 
card shown to the subject. The proposed procedure for the administration was; 

1. Listen to the excerpt. 

2. Move to the music, changing when the music changes. 

3. Listen again, refer to the card if necessary, and tell what 
change occurred. 

Items 4, 6, and 7 were deleted from the measure because changes in 
these excerpts occurred gradually rather than at a definite point. Many ex- 
cerpts in which the music had a definite point of change were examined and 
three new items accepted. A trial with another sample was planned with this 
revised measure (Appendix B, Overt Measure, Form 4). 



Boalsburg Trial, November 1965 



Purpose. The purpose of this trial was to examine the effective- 
ness of the oral response in relation to the overt response. 

Subjects . The subjects were eight randomly-selected fourth-grade 
pupils from a classroom in the Boalsburg Elementary School, Boalsburg, Penn 
sylvanla. 



Materials and Procedures . The measure used in this trial contained 
nine items with each item receiving two points for a possible total of eight- 
een points. The subjects were judged on any change of movement to the pre- 
dominant change in the music (one point) and for the correct oral response 
(one point). Subjects were instructed to (1) listen to the excerpt, (2) move 
to the music, changing when the music changes, and (3) listen again, refer to 
the card if necessary, and tell what change occurred. 

Results. The results showed a mean of 5.4 and a standard deviation 
of .86 for changing when the music changed, and a mean of 4.25 and a standard 
deviation of 2.44 for the oral response. 

Discussion. The results of this trial indicated that the oral re- 
sponse worked satisfactorily with this group. The oral response was then 
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incorporated into the measure and the revised measure accepted for the Pilot 
Study (Appendix B, Overt Measure, Form 4). 



Pilot Study, January 1966 



Purpose . The purpose of this administration was (1) to examine the 
items and the procedure for the administration of the measure using a random 
sample of fourth-grade pupils and (2) to obtain data for statistical analysis. 

Subjects . The subjects were thirty-eight randomly-selected pupils 
from two randomly-selected fourth-grade classrooms in two schools, one rural 
and one urban, in the Washington County, Maryland, School District. 

Materials and Procedures . The measure contained nine items, each 
item receiving two points for a possible total of eighteen points. The sub- 
jects were judged on any. change of movement to the predominant change in the 
music (one point) and for the correct oral response (one point). Subjects 
were instructed to (1) listen to the excerpt, (2) move to the music, changing 
when the music changes, (3) listen again, refer to the card if necessary, and 
tell what change occurred. A classroom orientation to the measure was given 
at the completion of the group measures. In this procedure, pupils listened 
to a recorded example and suggested possible physical movements and the cor- 
rect oral response. 

Results . The means, standard deviations, and reliability estimates 
for the total measure and the dimensions of pitch, duration, and loudness are 
shown in Table IX, 



TABLE IX 



MEANS, STANDARD DEVIATIONS, AND RELIABILITY 
ESTIMATES FOR THE OVERT MEASURE 
PILOT STUDY 
N - 38 





Dimension 


Number 
of items 


Mean 


Standard 

deviation 


Reliabiltiy 

estimate* 



Pitch 


6 


1.90 


1.50 


.485 


Duration 


6 


4.37 


1.62 


.644 


Loudness 


6 


3.76 


1.24 


.080 


Total 


18 


10.03 


3.61 


.479 



^Strata fixed reliability estimate from RELIB, a computer program 
at The Pennsylvania State University (Appendix E) 
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An item analysis (Table X) showed that the discrimination indices 
ranged from .36 to .80 and difficulty indices ranged from .20 to .85. Kuder 
Richardson indices of agreement between the overt response and the oral re- 
sponse for the dimensions measured were: pitch, -.03; duration, .54; and 

loudness, .05. 



Discussion . Although indices of agreement indicated that there was 
no significant relationship between the physical response and the oral re- 
sponse for the dimensions of pitch and loudness, the procedure using both 
overt and oral responses worked satisfactorily with these subjects. After an 
examination of the item analysis. Item 6 was deleted; a new item was developed 
and incorporated into the measure. A validation of the revised measure was 
planned. 



Validation Trial, August 1966 



The musical excerpts in the Overt Measure were submitted to seven- 
teen graduate students in music education at The Pennsylvania State Univer- 
sity for validation by experts. The results of this v'^lidation. Table XI, 
show that a high proportion of the experts agreed on the musical changes in 
the excerpts and the measure was accepted as having content validity. 



TABLE XI 

RESULTS OF VALIDATING TRIAL 
OVERT MEASURE 
N = 17 





Item number 


Correct response 


Number responding correctly 



1 


softer 


16 


2 


faster 


17 


3 


slower 


16 


4 


louder 


17 


5 


slower 


17 


6 


higher 


14 


7 


lower 


17 


8 


louder 


17 


9 


higher 


14 



The final form of the Overt Measure, containing nine excerpts rang- 
ing from thirteen to twenty- five seconds in length (Appendix C, Overt Measure, 
Main Study), was prepared with the assistance of the Division of Instructional 
Services of The Pennsylvania State University. 
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Written and Nonwritten Measures 



It was decided in December of 1965, after extensive consideration 
and consultation, to derive six scores from the Battery of Musical Concept 
Measures. The plan was to combine the pitch, duration, and loudness scores 
from the Verbal and Listening Measures by using stanines to produce derived 
scores for Written Pitch, Written Duration, and Written Loudness. The pitch, 
duration, and loudness scores from the Manipulative and Overt Measures were 
to be combined by using stanines to produce Nonwritten Pitch, Nonwritten 
Duration, and Nonwritten Loudness. Written scores would have a possible 
total of fourteen points and nonwritten scores a possible total of twelve 
points (Table XII) . 

Data from the Pilot Study were analyzed in terms of the written 
and nonwritten derived scores. Table XIII shows the means and standard de- 
viations for the raw scores derived from the separate measures and the written 
and nonwritten derived scores. 

Before combining the verbal and listening scores into a written 
score and combining the manipulative and overt scores into a nonwritten^ 
score, it was necessary to ascertain whether the means and standard devia- 
tions from the separate measures were equal for each dimension. The scores 
in Table XIII show that the means and standard deviations of the scores for 
the different dimensions, as derived from the measures of the battery, were 
not equal (e.g., the means and standard deviations for verbal pitch are not 
equal to those for listening pitch). Therefore, in order to have comparable 
scores for obtaining the written and nonwritten scores, it was decided to 
convert the derived scores from the separate measures for each dimension into 
a standard score. The stanine was the type of standard score selected, chosen 
to avoid a false aura of accuracy. 

Stratified reliability estimates were obtained from the computer 
program RELIB, which gives a complete analysis of variance reliability (Ap- 
pendix E). Table XIV shows the strata fixed reliability estimate, strata 
j-gliabilities , ratio of subjects by strata mean square to residual mean 
square, and correlations between strata for the Pilot Study sample as derived 
from this computer program. 

The reliability estimates of all the measures, as indicated in 
Table XIV, are good except for Nonwritten Loudness, which shows low reliabil- 
ities for the Manipulative and Overt Measures. Strata exist for all written 
and nonwritten derived scores for the dimensions except Written Duration, 
which may indicate listening and verbal measurements of duration are not con- 
tributing different information. 

The computer program COREL (Appendix E) , used with the thirty-four 
students for whom complete data were available, produced the matrix in Table 

XV. 



Table XV presents reliability and validity estimates. The under- 
lined numbers are validity estimates, all significantly different from zero. 
Also included in this table are correlations of the written and nonwritten 
derived scores with IQ. Each of the written and nonwritten derived scores 
for the dimensions of pitch, duration? and loudness, except Nonwri-tten Loud- 
ness, correlates about .50 with IQ. Written Pitch, Written Loudness, and 
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Nonwritten Loudness showed correlations with IQ lower than their validity 
coefficients . 

The Written and Nonwritten Measures generally showed a favorable 
analysis, which warranted use of this design in the Main Study. 
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DESIGN AND DEVELOPMENT OF THE BATTERY OF 
MUSICAL CONCEPT MEASURES 
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CHAPTER IV 



MAIN STUDY 



Purpose 



The purpose of the Main Study administration was to obtain data for 
statistical analyses including (1) reliability estimates for the four meas- 
ures and for the derived scores of Written and Nonwritten Pitch, Duration, 
and Loudness, (2) item analysis of the several measures, (3) correlation co- 
efficients and partial correlations between measures, derived scores, and 
scores from standard intelligence and reading achievement tests, and (4) con- 
struction of stanine conversion tables for the derived scores. 



Subjects 



Twelve Pennsylvania school districts, situated in a geographical 
area of approximately 23,000 square miles, were invited to participate in the 
Main Study. All twelve school districts, which represented various socio- 
economic levels, agreed to participate. Administrators and music supervisors 
from the school districts were invited to the University Park Campus in May, 
1966, to meet with the research team prior to the administration of the meas- 
ures. They were briefed on the purposes and procedures of the project. Fol- 
lowing this conference, information sheets and questionnaires were sent to 
the school districts (Appendix D). 

On the basis of information received, one or two fourth-grade class- 
rooms were randomly selected in each school district from buildings with ade- 
quate testing facilities, making a total of sixteen classrooms. Names of 
schools participating in the Main Study and the number of subjects from each 
school appear in Table XVI. 

The group measures (Verbal and Listening) were administered to the 
entire classroom, following which fifteen or more pupils were randomly se- 
lected from the classroom sample. The individual measures (Overt and Mani- 
pulative) were administered to these pupils in the order of their selection. 

In most instances fewer than fifteen pupils were tested individually because 
of the time limitations of the school schedule. 



Materials and Procedures 



The Battery of Musical Concept Measures was administered in two 
parts as follows: (1) Vex'bal and Listening Measures were given to entire 

classroom groups and (2) Manipulative and Overt Measures were given individu- 
ally to randomly-selected subjects in separate rooms. A general orientation 
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to the procedure and personnel preceded the group testing session. After be- 
ing introduced to the class, the research team established a relaxed but coop- 
erative atmosphere in the testing situation, telling subjects that the infor- 
mation was for research purposes and would not affect their grades in music. 
The administrator also explained that the measures were an attempt to find out 
what children knew and thought about music, and what they heard when listening 
to music. Children were supplied with pencils and erasers, and other physical 
arrangements were checked before beginning the Verbal Measure. 

The Verbal Measure consisted of eighteen multiple-choice items plus 
one sample item. The form included printed directions for marking answers 
(Appendix C). The administrator read the printed directions aloud and helped 
children complete the sample item. Children were then to ask questions if 
they did not understand the procedure, and they were instructed to raise 

questions if difficulty was encountered reading test items. When all pre- 

liminary questions were answered, the group proceeded with the eighteen items. 
When a child indicated difficulty reading words in any item, he was assisted 
by a member of the research team. The total time to complete the items was 

approximately twenty minutes, although many children finished in less time. 

The Listening Measure of eighteen items was administered next using 
a recording of the music excerpts. Subjects marked answers on a multiple- 
choice answer sheet. (See Appendix C for musical source of examples Included, 
the answer sheet used by subjects, and directions fcr administration.) The 
measure was divided into three sections, with instructions and sample items 
for each section. Children were encouraged to ask questions concerning the 
procedure after each set of Instructions. Including the time for directions 
ai-\d sample items, the testing time was approximately twenty-five minutes. 

The Manipulative Measure of eighteen items was administered indivi- 
dually to children who were randomly selected from the classroom samples. 

(See Appendix C for method of administration and a copy of test items.) The 
administrator asked the child to play various music-making instruments in 
different combinations and make judgments concerning the pitch, duration, or 
loudness of the sounds produced. Each of the eighteen judgments was rated 
right or wrong. Time for administration of the measure varied from ten to 
fifteen minutes. 

The Overt Measure was the final measure in order of administration. 
It contained nine musical examples to which subjects responded in two ways: 

(1) by overt-physical movement, showing a change in movement when the music 
changed, and (2) by an oral response telling how the music changed. A gen- 
eral classroom orientation to the Overt Measure was included in the group- 
testing session, after the Verbal and Listening Measures were completed; in- 
dividual subjects were again given complete Instructions immediately prior to 
beginning the measure. (See Appendix C for classroom orientation, method of 
administration, copy of musical sources, judges' answer form, and stimulus 
card for oral answers.) The subject heard 2 ach of the items three times. He 



^Beginning with the third school system tested, a chart of seven 
words on which children most frequently requested help was read for and by 
the children before the administration of the measure (Appendix C). 
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was instructed to (1) listen to the music the first time and decide how he 
could move to it, (2) move with the music on the second hearing, showing a 
change where the music changed, and (3) listen again, refer to the card if 
necessary, and tell what change occurred. The administrator checked the an- 
swer form, giving one point for change of movement at the proper time and one 
point for the correct oral answer. Testing time was approximately fifteen 
minutes per subject. 



Results 



Reliability 

The means and standard deviations of the Verbal, Listening, Manipu- 
lative, and Overt Measures appear in Table XVII. Since the Verbal and Listen- 
ing Measures were administered to the total sample of 429 subjects, means and 
standard deviations are presented for both the total sample of 429 and the 
random sample of 214 subjects. It should be noted that the means and standard 
deviations for the Listening Measure were computed on the eighteen items, with 
each item, including those with double answers, receiving one point. In 
other analyses dividing the Listening Measure into scores for pitch, duration, 
and loudness (Tables XVIII, XIX, and XX), double items were scored for two 
dimensions, producing eight points for each dimension for a total possible 
score of twenty-four. 



TABLE XVII 



MEANS AND STANDARD DEVIATIONS 
FOR THE FOUR MEASURES 
MAIN STUDY 





Measure 


Number 
of items 


Number of 
subjects 


Mean 


Standard 

deviation 



Verbal 


18 


214 


8.39 


3.54 


Verbal 


18 


429 


8.26 


3.51 


Listening 


18 


214 


9.43 


3.80 


Listening 


18 


429 


9.18 


3.76 


Manipulative 


18 


214 


10.98 


2.94 


Overt 


18 


214 


10.43 


2.93 



The means and standard deviations of the derived scores for the di- 
mensions of pitch, duration, and loudness, as measured by the four separate 
measures, appear in Table XVIII. The means and standard deviations for the 
dimensions of pitch, duration, and loudness, derived from the combined Writ- 
ten Measures and Nonwritten Measures, appear in Table XIX. 



48 



TABLE XVIII 



MEANS AND STANDARD DEVIATIONS FOR THE DERIVED 
SCORES FROM THE FOUR MEASURES 
MAIN STUDY 
N - 214 



Number 

Measure of items 


Dimension 


Pitch 


Duration 


Loudness 


Mean S.D. 


Mean S.D. 


Mean S.D. 


Verbal 6 
Listening 8 
Manipulative 6 
Overt 6 


2.486 1.400 
3.818 2.122 
2.276 1.720 
1.930 1.346 


3.206 1.599 
5.313 1.772 
4.154 1.360 
4.439 1.247 


2.706 1.460 
5. .’9 1.561 
4.551 1.104 
4.065 1.341 



TABLE XIX 



MEANS AND STANDARD DEVIATIONS OF THE WRITTEN 
AND NONWRITTEN DERIVED SCORES 
MAIN STUDY 
N - 214 




Derived 

score 



Number Standard 

of items Mean deviation 



Written Pitch 
Nonwritten Pitch 
Written Duration 
Nonwritten Duration 
Written Loudness 
Nonwritten Loudness 



14 


6.31 


3.01 


12 


4.21 


2.42 


14 


8.52 


2.84 


12 


8.59 


2.06 


14 


8.21 


2.44 


12 


8.62 


1.79 



The reliability estimates, ratio of subject by strata mean square 
to residual mean square, correlations among strata, and strata reliabilities 
were computed for the four measures using an analysis of variance computer 
program, RELIB (Appendix E). These data appear in Table XX. 

Table XX shows that the reliability estimates range from .644 to 
.846, with the Written Measures yielding the highest reliability estimates. 
Most of the strata reliabilities are substantial; however, the strata reli- 
abilities for the Listening Measure are higher than those for the other three 
measures. The ratio of subjects by strata mean square to residual mean square 
(test of the hypothesis that strata do not exist) was not found to be signifi- 
cant for the Listening Measure. Further evidence of nonexistent strata in 
the Listening Measure is shown by item correlations between strata as high as 
item correlations within strata. The correlations among strata range from 
.07 to .45 for the remaining measures, and each correlation, except the corre- 
lation between Manipulative Duration and Loudness, is significantly different 
from zero. 



Table XXI presents reliability estimates for written and nonwritten 
derived scores, strata reliabilities, ratio of subject by strata mean square 
to residual mean square, and correlations among strata, computed by a com- 
puter analysis of variance program, RELIB (Appendix E) . 

An examination of the ratio of subjects by strata mean square shows 
that strata exist for each of the derived scores. This ratio indicates that 
the Verbal and Listening Measures are separate measures contributing differ- 
ent information to the written derived scores. This is also true of the 
Overt and Manipulative Measures. 

The correlations among strata, ranging from .322 to .426, are fur- 
» ther evidence that the Verbal and Listening Measures contribute different in- 
formation to the written derived scores. The Manipulative and Overt Measures, 
with correlations ranging from .061 to .246, contribute different informatibn 
to the nonwritten derived scores. 



Item Analysis 



An item analysis was computed for each measure, using a library 
computer program ITANL (Appendix E). Tables XXII to XXXIII present the item 
analyses for the four measures (Verbal, Listening, Manipulative, and Overt) 
and for the written and nonwritten derived scores for the dimensions of pitch, 
duration, and loudness. These include the following information: per cent 

of subjects answering an item correctly (item difficulty index), item-test 
correlation (item discrimination index), and the dimension measured. 

The item analyses for the Verbal and Listening Measures were com- 
puted for both the random sample of 214 subjects and for the total Sample of 
429; therefore two item analysis tables appear for each of these measures 
(Tables XXII to XXV). Since the Manipulative and Overt Measures were admin- 
istered only to the random sample of 214, a single item analysis appears for 
each measure (Tables XXVI and XXVII). 
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TABLE XXII 



VERBAL MEASURE ITEM ANALYSIS 
MAIN STUDY 
N = 214 



Item 


Per cent of students 
answering correctly 


Item-test 

correlation 


Dimension 

measured 


1 


.720 


.379 


L 


2 


.350 


.371 


P 


3 


.360 


.484 


D 


4 


.617 


.587 


D 


5 


.589 


.544 


D 


6 


.322 


.629 


L 


7 


.318 


.498 


L 


8 


.439 


.522 


D 


9 


.164 


.381 


P 


10 


.593 


.571 


D 


11 


.519 


.518 


L 


12 


.463 


.648 


L 


13 


.407 


.373 


P 


14 


.364 


.481 


L 


15 


.579 


.561 


P 


16 


.612 


.583 


D 


17 


.570 


.527 


P 


18 


.421 


.666 


P 
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TABLE XXIII 



VERBAL MEASURE ITEM ANALYSIS 
MAIN STUDY 
N = 429 





Item 


Per cent of students 
answering correctly 


Item-test 

correlation 


Dimension 

measured 





1 


.718 


.427 


L 


2 


.366 


.390 


P 


3 


.406 


.474 


D 


4 


.599 


.554 


D 


5 


.543 


.589 


D 


6 


.336 


.503 


L 


7 


.275 


.398 


L 


8 


.466 


.512 


D 


9 


.177 


.488 


P 


10 


.534 


.505 


D 


11 


.541 


.541 


L 


12 


.417 


.641 


L 


13 


.417 


.408 


P 


14 


.368 


.486 


L 


15 


.541 


.569 


P 


16 


.606 


.552 


D 


17 


.564 


.568 


P 


18 


.389 


.649 


P 
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TABLE XXIV 



LISTENING MEASURE ITEM ANALYSIS 
MAIN STUDY 
N « 214 



Item 


Per cent of students 
answering correctly 


Item-test 

correlation 


Dimension 

measured 


1 


.500 


.599 


L 


2 


.533 


.758 


P 


3 


.542 


.739 


P 


4 


.369 


.590 


L 


5 


.360 


.742 


D 


6 


.855 


.483 


D 


7 


.556 


.641 


D + L 


8 


.607 


.582 


P + L 


9 


.626 


.430 


D + L 


10 


.393 


.863 


P + D 


11 


.621 


.678 


P + D 


12 


.313 


.613 


P + L 


13 


.897 


.571 


D 


14 


.463 


.365 


P 


15 


.575 


.450 


L 


16 


.164 


.295 


P 


17 


.626 


.423 


L 


18 


.458 


.518 


D 



TABLE XXV 



LISTENING MEASURE ITEM ANALYSIS 
MAIN STUDY 
N « 429 



Item 


Per cent of students 
answering correctly 


Item- test 
correlation 


Dimension 

measured 


1 


.487 


.665 


L 


2 


.462 


.755 


P 


3 


.473 


.694 


P 


4 


.357 


.584 


L 


5 


.368 


.749 


D 


6 


.862 


.441 


D 


7 


.557 


.675 


D + L 


8 


.594 


.639 


P + L 


9 


.615 


.427 


D + L 


10 


.368 


.872 


P + D 


11 


.590 


.678 


P + D 


12 


.331 


.605 


P + L 


13 


.893 


.465 


D 


14 


.410 


.340 


P 


15 


.594 


.392 


L 


16 


. 156 


.301 


P 


17 


.646 


.366 


L 


18 


.431 


.554 


D 
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table XXVI 



MANIPULATIVE MEASURE ITEM ANALYSIS 
MAIN STUDY 
N » 214 



Item 


Per cent of students 
answering correctly 


Item-test 

correlation 


Dimension 

measured 


1 


.477 


.464 


D 


2 


.612 


.522 


D 


3 


.308 


.671 


P 


4 


.280 


.600 


P 


5 


.780 


.399 


L 


6 


.748 


.396 


L 


7 


.570 


.626 


P 


8 


.579 


.722 


P 


9 


.612 


.555 


D 


10 


.813 


.419 


D 


11 


.813 


.333 


L 


12 


.794 


.310 


L 


13 


.285 


.665 


P 


14 


.252 


.654 


P 


15 


.757 


.399 


D 


16 


.879 


.398 


D 


17 


.505 


.249 


L 


18 


.911 


.347 


L 



table XXVII 



OVERT MEASURE ITEM ANALYSIS 
MAIN STUDY 
N » 214 



Per cent of students 
Item answering correctly 


Item-test 

correlation 


Dimension 

measured 


1 


.832 


.450 


L 


2 


.860 


.691 


D 


3 


.893 


.576 


D 


4 


.505 


.576 


L 


5 


.855 


.628 


D 


6 


.178 


.441 


P 


7 


.145 


.316 


P 


8 


.706 


.572 


L 


9 


.542 


.710 


P 


10 


.617 


.516 


L 


11 


.850 


.365 


D 


12 


.491 


.570 


D 


13 


.766 


.444 


L 


14 


.491 


.398 


D 


15 


.369 


.340 


P 


16 


.449 


.517 


P 


17 


.640 


.428 


L 


18 


.248 


.493 


P 


Note; Items 1-9 
Items 10- ! 


are Overt Responses 
18 are Oral Responses 


to the same musical examples 



TABLE XXVIII 



WRITTEN PITCH ITEM ANALYSIS 
MAIN STUDY 
N - 214 





Item 


Per cent of students 
answering correctly 


Item-test 

correlation 


Original 
item number 



1 


.350 


.379 


2 


2 


.164 


.484 


9 


3 


.411 


.386 


13 


4 


.579 


.657 


15 


5 


.565 


.491 


17 


6 


.421 


.528 


18 


7 


.533 


.861 


2 


8 


.542 


.771 


3 


9 


.659 


.533 


8 


10 


.467 


.855 


10 


11 


.640 


.640 


11 


12 


.346 


.713 


12 


13 


.463 


.408 


14 


14 


.168 


.261 


16 



Note: Items 1-6 are from Verbal Measure 

Items 7-14 are from Listening Measure 



TABLE XXIX 



WRITTEN DURATION ITEM ANALYSIS 
MAIN STUDY 
N = 214 



Per cent of students Item-test Original 

Item answering correctly correlation item number 



1 


.360 


2 


.617 


3 


.589 


4 


.439 


5 


.593 


6 


.612 


7 


.360 


8 


.855 


9 


.640 


10 


.701 


11 


.598 


12 


.790 


13 


.897 


14 


.458 



.507 


3 


.661 


4 


.532 


5 


.582 


8 


.599 


10 


.529 


16 


.669 


5 


.433 


6 


.540 


7 


.444 


9 


.670 


10 


. 616 


11 


.563 


13 


.620 


18 



Items 1-6 are from Verbal Measure 
Items 7-14 are from Listening Measure 



Note; 



TABLE XXX 



WRITTEN LOUDNESS ITEM ANALYSIS 
MAIN STUDY 
N = 214 



Item 


Per cent of students 
answering correctly 


Item-test 

correlation 


Original 
item number 


1 


.720 


.428 


1 


2 


.322 


.532 


6 


3 


.322 


.612 


7 


4 


.519 


.601 


11 


5 


.458 


.482 


12 


6 


.364 


.459 


14 


7 


.500 


.725 


1 


8 


.369 


.542 


4 


9 


.888 


.584 


7 


10 


.925 


.402 


8 


11 


.734 


.567 


9 


12 


.874 


.162 


12 


13 


.575 


.496 


15 


14 


.621 


.500 


17 



Note; Items 1-6 are from Verbal Measure 

Items 7-14 are from Listening Measure 



TABLE XXXI 

NONWRITTEN PITCH ITEM ANALYSIS 
MAIN STUDY 
N = 214 





Item 


Per cent of students 
answering correctly 


Item-test 

correlation 


Original 
item number 



1 


.308 


.606 


3 


2 


.280 


.577 


4 


3 


.575 


.686 


7 


4 


.579 


.721 


8 


5 


.285 


.650 


13 


6 


.252 


.792 


14 


7 


.178 


.380 


6 


8 


.369 


.334 


7 


9 


.145 


.390 


9 


10 


.449 


.624 


15 


11 


.537 


.461 


16 


12 


.252 


.661 


18 



Note: Items 1-6 are from Manipulative Measure 

Items 7-12 are from Overt Measure 



A 
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TABLE XXXII 



NONWRITTEN DURATION ITEM ANALYSIS 
MAIN STUDY 
N = 214 



Per cent of students 
Item answering correctly 



Item-test 

correlation 



Original 
item number 



1 


.477 


2 


.612 


3 


.612 


4 


.813 


5 


.757 


6 


.879 


7 


.860 


8 


.850 


9 


.888 


10 


.491 


11 


.855 


12 


.495 



.615 


1 


.596 


2 


.514 


9 


.463 


10 


.590 


15 


.494 


16 


.565 


2 


.583 


3 


.592 


5 


.553 


11 


.598 


12 


.487 


14 



Note; Items 1-6 are from Manipulative Measure 
Items 7-12 are from Overt Measure 
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TABLE XXXIII 



NONWRITTEN LOUDNESS ITEM ANALYSIS 
MAIN STUDY 
N = 214 





Item 


Per cent of students 
answering correctly 


Item- test 
correlation 


Original 
item number 



1 


.780 


.518 


5 


2 


.748 


.438 


6 


3 


.813 


.420 


11 


4 


.794 


.413 


12 


5 


.505 


.375 


17 


6 


.911 


.255 


18 


7 


.832 


.391 


1 


8 


.617 


.577 


4 


9 


.505 


.486 


8 


10 


.766 


.559 


10 


11 


.706 


.554 


13 


12 


.640 


.455 


17 



Note: Items 1-6 are from Manipulative Measure 

Items 7-12 are from Overt Measure 
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Item analyses for the Verbal and Listening Measures yielded better 
discrimination and difficulty indices than did analyses for the Manipulative 
and Overt Measures* In examining the item analyses for the written and non~ 
written measurements of the separate dimensions (Tables XXVIII to XXXIII)} it 
is evident that Written Pitch, Written Duration, and Written Loudness, accord- 
ing to the obtained indices, contain more effective items than Nonwritten 
Pitch, Nonwritten Duration, and Nonwritten Loudness. 



Correlations and Partial Correlations 



Table XXXIV presents a correlation matrix of the scores from the 
four measures, a standard intelligence test, and a standard reading achieve- 
ment test. The intercorrelations of the four measures are approximately the 
same as correlations of the four measures with IQ and with reading achievement. 
Among the intercorrelations of the four measures, the Listening Measure corre- 
lates the highest with each of the other three measures. Verbal and Listen- 
ing Measures correlate higher with IQ and reading scores than do Manipulative 
and Overt. 



Table XXXV presents an intercorrelation matrix of the written and 
nonwritten derived scores and IQ and reading scores. Various sized samples 
are presented in the matrix, since the school districts involved had used 
different IQ and reading tests; as is evident from the table, the correlations 
did not change appreciably from sample to sample. Each of the written derived 
scores for pitch, duration, and loudness correlates higher with reading scores 
than written derived scores correlate with nonwritten derived scores. The 
written derived scores for each dimension correlate higher with IQ and read- 
ing scores than the nonwritten derived scores correlate with IQ and reading 
scores . 



Table XXXVI presents correlations between the written and nonwritten 
derived scores, partial correlations holding out IQ, and partial correlations 
holding out reading scores. These partial correlations yield an estimate of 
the amount of correlation with the IQ or reading variable held constant. The 
difference between the partial and the original correlations suggests the 
amount of correlation due to the influence of reading or IQ. Significance of 
the partial correlations was computed using a Jt test (McNemar, 34). 

The partial correlations between Written and Nonwritten Pitch and 
Written and Nonwritten Duration were significant after either IQ or reading 
scores were partialled out, indicating that there was variance in these scores 
attributable to the concepts of pitch and duration. The partial correlation 
between Written and Nonwritten Loudness was not significant, indicating that 
the original correlation between the two measurements may have existed because 
of their relationship to either IQ or reading. This lack of relationship may 
be due in part to the low reliability of Nonwritten Loudness. 

The means and standard deviations for the IQ and reading scores 
appear in Table XXXVII. As indicated, an unrestricted range for the IQ and 
reading scores entered into the correlations. 
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*Values below .263 are nonsignificant at .05 level. 



TABLE XXXVII 



MEANS AND STANDARD DEVIATIONS FOR READING 
AND IQ SCORES, SELECTED SAMPLES 
MAIN STUDY 



Standard test 


Number of 
subjects 


Slean 


Standard 

deviation 


IQ 


57 


107.19 


13.87 


IQ 


94 


105.79 


13.16 


Reading 


57 


3.89 


1.01 


Reading 


69 


4.00 


.95 



Validity 



Content validity was investigated and established in the develop- 
ment of the Battery of Musical Concept Measures by consensus of experts. This 
is explained in the Development of the Measures (Chapter III), and is dis- 
cussed further in the discussion section of this chapter. 

Construct validity was investigated in the Main Study in relation 
to rigorous criteria set forth by Campbell and Flske (9), who propose evalu- 
ating a multitrait-multimethod matrix by four criteria; 

1. "The entries in the validity diagonal should be significantly 
different from zero and sufficiently large to encourage further examination 
of validity. This requirement is evidence of convergent validity" (p, 81), 

In Table XXXVIII the validity estimates for pitch, duration, and loudness are 
respectively ,481, .427, and .343, all significantly different from zero, 

2. "A validity diagonal value should be higher than the value lying 
in its column and row in the heterotrait-hetermethod triangles" (p. 81). In 
Table XXXVIII the validity estimates for pitch and duration, but not loudness, 
are higher than the correlations in the heterotrait-heteroinethod triangle* 

3. "A variable /shouj^/ correlate higher with an independent effort 
to measure the same trait than with measures designed to get at different 
traits which happen to employ the same method" (p, 82), In Table XXXVIII 
evidence of discriminant validity is given for Nonwritten Duration and Non- 
written Pitch, The highest correlations obtained are between Written Meas- 
ures, regardless of the concept being measured, showing the presence of method 
variance in the correlations, 

4. "The same pattern of trait interrelationship j^oujl^ be shown 
in all of the heterotrait triangles of both the monomethod and heteromethod 
blocks (p, 82), In Table XXXVIII this desideratum is met and provides evi- 
dence for discriminant validity. 



TABLE XXXVIII 



COBtRELATION MATRIX FOR THE WRITTEN AND NONWRITTEN MEASURES 
(MULTITRAIT -MULTIMETHOD MATRIX) 

MAIN STUDY 
N = 214 



Derived scores W.P. W.D. W.L. N.P. N.D. N.L. 



Written Pitch 
Written Duration 
Written Loudness 
Nonwrit ten Pitch 
Nonwrit ten Duration 
Nonwritten Loudness 



706 






620 


.689 




547 


.587 


.591 


481 


.412 


.394 


417 


.427 


.386 


394 


.310 


.343 



.652 

.398 .553 

.348 .326 .345 



Secondary Analysis of the Overt Measure 



Table XXXIX presents the percentages of oral responses for the Overt 
Measure. For each item, any one of the six responses (higher, lower, faster, 
slower, louder, softer) was possible since the items were open-ended with 
forced choice of one of the given responses. The obtained responses were 
tallied to investigate possible confusion of labels. For example, it had 
been expected that lower might be given as a response to softer music, and 
higher as a response to louder music, more often than other incorrect re- 
sponses. 



Table XXXIX shows that lower was the most frequent incorrect response 
to softer music, and higher was the most frequent incorrect response to louder 
music. When the music became faster , higher and louder were the incorrect 
responses equally chosen. When the music became slower , softer was the most 
frequent incorrect response, chosen by at least 26 per cent of the subjects. 

For items in which the musical change was higher , no consistent trend was 
found in the incorrect responses, except that lower and slower were the re- 
sponses least chosen. Since the items measuring a musical change of higher 
resulted in a wide variety of responses, implications from these items and 
the one lower item have not been drawni 

The Overt Measure contained nine items, each item having an overt 
response and an oral response. An estimate of the agreement between the oral 
and overt responses for each item was computed by Kuder -Richard son Formula 21, 
using as an agreement score for each subject the total number of times the 
subject had the same response, correct or incorrect, on both parts of an item 
(e.g., a score of 0 indicated no agreement on responses and a score of 3 in- 
dicated agreement on all response pairs for each dimension). As indicated in 
Table XL, the agreement between the overt and oral responses for each of the 
dimensions of pitch, duration, and loudness was low and nonsignificant. The 
means for the pitch, duration, and loudness agreement scores were approximately 
the same; this was also true for the standard deviations. There was no high 
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agreement between the overt and oral responses on any dimension. Children's 
movements to elements of music, as Investigated In this study, did not corre- 
late with what they expressed orally as having happened In the music. 



TABLE XXXIX 

ORAL RESPONSES GIVEN TO 
OVERT MEASURE ITEMS 
MAIN STUDY 
N - 214 



Response 

Item Correct 



number 


answer 


Higher 


Lower 


Louder 


Softer 


Faster 


Slower 


1 


Softer 


.08 


.15 


.03 


.62 


.01 


.10 


4 


Louder 


.13 


.00 


.77 


.01 


.10 


.00 


8 


Louder 


.14 


.04 


.64 


.02 


.12 


.02 


2 


Faster 


.06 


.004 


.06 


.02 


.85 


.004 


3 


Slower 


.00 


.11 


.01 


.37 


.01 


.50 


5 


Slower 


.06 


.12 


.03 


.26 


.03 


.49 


6 


Higher 


.37 


.08 


.24 


.07 


.16 


.07 


9 


Higher 


.25 


.08 


.07 


.21 


.28 


.09 


7 


Lower 


.06 


.45 


.08 


.11 


.20 


.09 



TABLE XL 

MEANS, STANDARD DEVIATIONS, AND KUDER-RICHARDSON 
ESTIMATES OF AGREEMENT FOR THE OVERT AND 
ORAL RESPONSES ON THE OVERT MEASURE 
MAIN STUDY 
N - 214 







Standard 


Estimate of 


Dimension 


Mean 


deviation 


agreement* 


Pitch 


1.640 


.868 


.023 


Duration 


1.804 


.885 


.123 


Loudness 


1.654 


.923 


.192 



*Kuder-Rlchardson Formula No. 21 



Stanines 



Nonnormalized stanine values were obtained for Written and Nonwrit- 
ten Pitch, Duration, and Loudness (Tables XLI to L). A stanine value was ob- 
tained from each possible score for each dimension on each measure. Since 
stanines are whole numbers by definition, each stanine value was rounded to 
the nearest whole number by the following procedure: 

If a value was less than or equal to X.50, this value was made equal 
to X. If an obtained value was greater than X.50, this value was 
made equal to X + 1. If a computed stanine was less than 1 or 
greater than 9, this value was made equal to 1 or 9, since stanines 
are, by definition, not less than 1 or greater than 9. 

The Verbal and Listening stanines were averaged to obtain a written 
score for each dimension. The Manipulative and Overt stanines were averaged 
to obtain a nonwritten score for each dimension. In averaging these stanines, 
any value equal to or greater than X.50 was made equal to X + 1 (e.g., 2_+_3j= 
2.5 = 3). ^ 



Discussion 



Reliability of Measures (Tables XX and XXI, pp. 50 5 2) 



One of the most important characteristics of a test is reliability.^^ 
"Reliability always refers to consistency throughout a series of measurements" 
(Cronbach, 11, p. 126). Reliability estimates can be obtained using statis- 
tics applied to the techniques of test-retest (stability estimate), parallel 
forms of the same test (equivalency estimate), or single administration 
test (internal consistency estimate). Each type of reliability yields a dif- 
ferent type of information, but each yields an estimate of the amount of test 
variance that is not error variance. 

Reliability for the Battery of Musical Concepts was investigated 
using an internal consistency estimate. A strata-fixed reliability estimate 
was chosen for the measurement of internal consistency, because the measures 
were heterogeneous and the strata fixed rather than random. 

The Listening Measure presents the highest overall reliability (.85) 
as well as the highest reliabilities for the three dimensions; in terms of 
reliability, it is the most satisfactory of the four measures. The reliabil- 
ity of the Verbal Measure (.71) is satisfactory for group measurement (Thorn- 
dike and Hagen, 52, p. 190). 

The overall reliability estimate for the Manipulative Measure (.66) 
is comparable to the estimate for the Overt Measure (.64). However, the 
strata reliabilities in the Manipulative Measure are more variable than the 
strata reliabilities in the Overt Measure. The low reliability estimate for 
Manipulative Loudness (.21), the lowest of the strata reliabilities, may be 
due in part to (1) the fine discriminations required, (2) the individual 
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TABLE XLI 



STANINE CONVERSION TABLE 
WRITTEN PITCH 



Number correct 
on 

Listening 

Pitch 




Number 


correct 


on 


Verbal Pitch 






0 


1 


2 


3 


4 


5 


6 


0 


1 


2 


3 


4 


4 


5 


6 


1 


2 


3 


3 


4 


5 


6 


6 


2 


2 


3 


4 


5 


5 


6 


7 


3 


3 


4 


4 


5 


6 


7 


7 


4 


3 


4 


5 


6 


6 


7 


8 


5 


4 


5 


5 


6 


7 


8 


8 


6 


4 


5 


6 


7 


7 


8 


9 


7 


5 


6 


6 


7 


8 


9 


9 


8 


5 


6 


7 


8 


8 


9 


9 



TABLE XLII 

STANINE CONVERSION TABLE 
NONWRITTEN PITCH 





Number correct 
on 

Overt 

Pitch 




Number 


correct 


on Manipulative 


Pitch 




0 


1 


2 


3 


4 


5 


6 


0 


2 


3 


4 


4 


5 


5 


6 


1 


3 


4 


5 


5 


6 


6 


7 


2 


4 


5 


5 


6 


6 


7 


7 


3 


5 


6 


6 


7 


7 


8 


8 


4 


5 


6 


7 


7 


8 


8 


9 


5 


6 


7 


7 


8 


8 


9 


9 


6 


6 


7 


7 


8 


8 


9 


9 



TABLE XLIII 



STANINE CONVERSION TABLE 
WRITTEN DURATION 



Number correct 
on 

Listening 

Duration 




Number 


correct 


on Verbal 


Duration 




0 


1 


2 


3 


4 


5 


6 


0 


1 


2 


2 


3 


4 


4 


5 


1 


1 


2 


2 


3 


4 


4 


5 


2 


1 


2 


2 


3 


4 


4 


5 


3 


2 


2 


3 


4 


4 


5 


5 


4 


3 


3 


4 


5 


5 


6 


6 


5 


3 


4 


4 


5 


6 


6 


7 


6 


4 


4 


5 


6 


6 


7 


7 


7 


4 


5 


5 


6 


7 


7 


8 


8 


5 


5 


6 


7 


7 


8 


8 



TABLE XLIV 

STANINE CONVERSION TABLE 
NONWRITTEN DURATION 



Number correct 
on 

Overt 

Duration 




Number 


correct 


on Manipulative 


Duration 




0 


1 


2 


3 


4 


5 


6 


0 


1 


1 


2 


2 


3 


4 


5 


1 


1 


1 


2 


2 


3 


4 


5 


2 


1 


1 


2 


2 


3 


4 


5 


3 


2 


2 


3 


3 


4 


5 


6 


4 


3 


3 


3 


4 


5 


5 


6 


5 


4 


4 


4 


5 


6 


6 


7 


6 


4 


4 


5 


5 


6 


7 


8 
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ERJC 



TABLE XLVII 





STANINE CONVERSION 
VERBAL MEASURE 


TABLE 








Dimension 




Number correct 


Pitch 


Duration 


Loudness 


0 


1 


1 


1 


1 


3 


2 


3 


2 


4 


3 


4 


3 


6 


5 


5 


4 


7 


6 


7 


5 


9 


7 


8 


6 


9 


8 


9 



TABLE XLVII I 

STANINE CONVERSION TABLE 
LISTENING MEASURE 



Number correct 


Dimension 


Pitch 


Duration 


Loudness 


0 


1 


1 


1 


1 


2 


1 


1 


2 


3 


1 


1 


3 


4 


2 


2 


4 


5 


4 


3 


5 


6 


5 


4 


6 


7 


6 


6 


7 


8 


7 


7 


8 


9 


8 


8 



76 




TABLE XLIX 





STANINE CONVERSION TABLE 
MANIPULATIVE MEASURE 


Number correct 


Dimension 

Pitch Duration Loudness 


0 


2 1 1 


1 


4 11 


2 


5 2 1 


3 


6 3 2 


4 


7 5 4 


5 


8 6 6 


6 


9 8 8 





TABLE L 

STANINE CONVERSION TABLE 
OVERT MEASURE 


Number correct 


Dimension 

Pitch Duration Loudness 


0 


2 1 1 


1 


4 1 1 


2 


5 12 


3 


7 3 3 


4 


8 4 5 


5 


9 6 6 


6 


9 7 8 



differences in manlpulatlcn of the devices, which resulted in a variety of 
acceptable answers, and (3) the large proportion of subjects making correct 
responses to the items, or a combination of these# 

In general, the Verbal and Listening Measures are superior to the 
Overt and Manipulative Measures, both in total reliability and in reliability 
for the three dimensions. 

The ratio of subjects by strata mean square to residual mean square 
is a test of the hypothesis that strata do not exist. All ratios for the 
three dimensions on the Written Measures ar 2 significant, indicating that the 
Verbal and Listening Measures are separate measures contributing different 
information to the written derived scores. The ratios for the Nonwritten 
Measures indicate that the Manipulative and Overt Measures contributed dif- 
ferent information to the nonwritten derived scores. This provides evidence 
that the Verbal, Listening, Manipulative, and Overt Measures are separate 
measures, measuring either different aspects of the musical concepts or dif- 
ferent concepts. The authors believe that these four modes of measurement 
reveal different aspects of the musical concepts of pitch, duration, and loud- 
ness . 



Correlations among strata are further indication of the relation- 
ships among the different modes of measurement. Generally, a high correla- 
tion may indicate that the strata are contributing similar information; a low 
correlation may indicate that the measures are different and are contributing 
unique information in that they are measuring different aspects of the same 
concepts, or that they contributed different information by measuring differ- 
ent concepts. Low reliability in separate measures also may contribute to low 
correlations. 

The correlations among strata, ranging from .322 to .426 for the 
Written Measures, suggest that the Verbal and Listening Measures are contri- 
buting different information to the written derived scores for the different 
dimensions. The correlations among strata for the Nonwritten Measures, rang- 
ing from .061 to .246 for the different dimensions, suggest that the Manipu- 
lative and Overt Measures contributed different information to the nonwritten 
derived scores. The authors believe that the low correlations among strata 
are evidence that the Verbal Measure is different from the Listening Measure 
in measuring concepts of pitch, duration, and loudness, and the Manipulative 
Measure is different from the Overt Measure in measuring these concepts. 



Item Analysis (Tables XXII to XXXIII, pp. 53 to 64) 

Item analyses are used primarily for selecting the best items for 
the final form of a test, also for improving validity and reliability. In 
general, items with item-test correlations between .30 and .80 present tests 
with satisfactory reliability and validity estimates, and difficulty indices 
of approximately .50 produce favorable reliability estimates (Guilford, 18, 
p. 471). 



Of the thirty-six items in the Verbal and Listening Measures (Tables 
XXII to XXV, pp. 53 to 56) , thirty are acceptable according to the stated 
criteria, contributing to the favorable reliability estimates of these 



78 



measures* It is assumed that future revision of the remaining six items, to 
conform with the criteria, would further increase the reliability. 

In the Manipulative Measure (Table XXVI, p. 57) items measuring the 
pitch dimension are more acceptable, according to the criteria, than items 
measuring duration or loudness. One reason for this is that most subjects 
made correct responses to the duration and loudness items. 

The Overt Measure item analysis (Table XXVII, p. 58) shows better 
indices for the oral responses than for the overt responses, with only three 
of the nine overt responses meeting the difficulty criterion. This is indi- 
cation that the oral responses constituted better items than the overt re- 
sponses . 



In summary, the written items for pitch, duration, and loudness 
(derived from the Verbal and Listening Measures) are generally superior to 
the nonwritten items (derived from the Manipulative and Overt Measures). 

This is also apparent in the item analyses for the derived scores for each 
dimension (Tables XXVIII to XXXIII, p. 59 to 64). The implication is that 
the Written Measures provided the more effective modes for identifying chil- 
dren's musical concepts. 



Correlations and Partial Correlations (Tables XXXIV to XXXVII, pp. 66 to 69) 



The present study investigated the correlation of the Battery of 
Musical Concept Measures with a meastjijre of intelligence, in accordance with 
the following statement of Campbell (8): 

A new test, no matter what its content, should be correlated with 
an intelligence test of as similar format as possible. . . . 

If correlations are reported with independent trait-appropriate 
or criterion measures, it should be demonstrated that the new 
test correlates better with these measures than does the intelli- 
gence test (p. 548). 

Correlations were computed for the written and nonwritten derived 
scores of each dimension with IQ scores (California Test of Mental Maturity) 
and reading achievement scores (California Achievement Test) for all subjects 
with these scores available. Because of the relatively small samples (fifty- 
seven for both scores, ninety-four for IQ, and sixty-nine for reading), conclu- 
sions drawn from this data must be considered tentative (Table XXXV, p. 67). 

The correlations for Written Pitch, and for Nonwritten Duration in 
one instance, meet Campbell's recommendation j the correlations for the other 
derived scores do not. This may be due in part to the higher reliability of 
the standardized IQ test entering into the correlations with the derived 
scores as opposed to the lower reliabilities of the two derived scores being 
correlated. The implication is that the measures should be further refined, 
with efforts concentrated on increasing their reliability and validity. All 
correlations of both IQ and reading scores with written derived scores are 
higher than correlations of IQ and reading scores with nonwritten derived 
scores. A factor of general mental ability seems to be reflected in all 
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correlations, since none is zero or negative* This tends to confirm the authors' 
opinion that measures of musical concepts which incorporate cognitive aspects 
of musical understanding almost inevitably will show a substantial correlation 
with intelligence. 

The correlation of reading scores with Overt Measure scores, while 
significant, is lower than reading score correlations with the other three 
measures, accounting for only a small amount of the variance. This suggests 
that performance on the Overt Measure is less dependent on reading ability 
than performance on the other measures; the Overt Measure, therefore, may be 
suitable for use with deficient readers. 

Table XXXVI (p. 68) shows partial correlations for the written and 
nonwrltten derived scores of the three?' dimensions holding out reading and IQ 
scores. The consistent decrease in the intercorrelations when either IQ or 
reading scores are partialled out suggests that IQ and reading achievement 
enter similarly into all of the intercorrelations, and the relationship is not 
unique to any particular correlation. Correlations between Written and Non- 
written Pitch, and between Written and Nonwrltten Duration, are significant 
after partialling out either reading or IQ scores; the partial correlation 
between Written and Nonwrltten Loudness did not reach significance. This 
suggests that a factor in addition to intelligence or reading achievement 
contributed to the variance in the pitch and duration scores. The authors 
believe this factor is the musical concept being measured and that concepts 
of pitch and duration are indeed accounting for some variance in these scores. 



Validity 



Validity, as Guilford (18) notes, is a "highly relative concept," 
its chief requisite being that, when eKhibited as the quality of a test, the 
test measure what it is supposed to measure. Validity, however, varies with 
the use to which the instrument possessing it is put; in Guilford's words, 
the question should not ba "Is this test valid?" but rather "Is it valid for 
what?" (p. 461) 

The experimental Battery of Musical Concept Measures described in 
the present study began with the idea of developing a technique for identify- 
ing children's concepts of pitch, duration, and loudness, or, to paraphrase 
Cronbach (11), with only a concept for which a testing instrument was desired. 
The interpretation of tests, Cronbach observes; 

... is built up very gradually, and probably is never complete. 

As knowledge develops, we arrive at a more complete listing of the 
influences that affect the test score, and may be able to estimate 
the strength and character of each influence. At present, the 
interpretation of even the best-established psychological tests 
falls far short of the ideal (pp. 120-1). 

Establishing the validity of this study's measures, therefore, has 
presented challenging problems, since the Battery represents an attempt to 
measure a cognitive aspect of an aural (musical) experience. 
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In general, four types of validity are discussed with reference to 
psychological testing: predictive, concurrent, construct, and content. Pre- 

LLive and concurrent validity are criterion oriented. Predictive validity 
is relevant to the establishment of a predictive function in terns of a cri- 
terion. Since the present investigators were chiefly interested in develop- 
ing a technique for identifying musical concepts, predictive validity was not 
investigated. Concurrent validity is usually established when the intent is 
to substitute one test, measure, or other type of judgment, for another. 

Since no a priori measure or criterion was available for the present Battery, 
concurrent validity was not attempted# 

Content validity was investigated during the development of the 
Measures by means of expert judgment of the items developed, the content of 
the Measures. Such judgment was first utilized as the research 

oped, considered, accepted, rejected, and tested ^^ent 

tice of the team continued throughout the development of the items. Content 
validity was also Investigated in terms of consensus of other music experts. 
Therefore, the items developed for the Battery of Musical Concept Measures 
were deemed valid for the measurement of pitch, duration, and loudness (cf. 

Chapter III) . 

Since it is generally accepted that construct validity should be in- 
vestigated when no criterion in the field is available (Cronbach Meehl , 12) , 
this was investigated using three different approaches: (1) 

IQ measures, (2) correlations with reading measures, and (3) the multitrait 
mJltSod matrix (Campbell and Fiske, 9). It should be noted that the rig- 
orous criteria proposed by these authors may not be entirely applicable to 
measurement of an essentially aural concept, and ^^sj^°tJieretofore been ap- 
plied to such a measurement. Nevertheless, it was thought Important to apply 
the criteria essentially as a means of identifying areas in which the meas- 
ures might be strengthened or refined, rather than as an assessment of present 
status. The discussion of construct validity, therefore, in reference to e 
nuUitrait-muUluethod matrix (Table XXXVIll, p. 70) will center on the four 
criteria of Campbell and Flake (cf. p. 69). This matrix preaenta reaulta in 
line with their "typical case," showing more method variance than trait vari 
ance. Campbell and Fiske comment that such results may seem more dlsappolnt- 
Ine than would validity data presented as Isolated values plucked from a 
vafldity diagonal, but they also state that these Isolated values present a 
deceptive picture of validity. The multltralt-multlmethod "-^^trlx presents 
data^ from which the experimenter may draw Implications ^ 

research, and for developing better tests rather than simply evaluating cur 

rent tests. 

All validity estimates in the matrix for Written and Nonwrltten 
Measures are significant, although certain of these could be Increased 
through test refinement. Various interpretations of the validity estimates 
are pLslble, but the authors tend to believe that the Written Measures, with 
their higher reliability estimates, functioned better than the Nonwrltten 
Measures in identifying children's musical concepts. 



Secondary Analysis of Overt Measure (Tables XXXIX and XL, p. 71) 



One of the expectations of the study was that children might evi- 
dence confusion in identifying the various musical elements. The Overt Meas- 
ure, with a free choice of alternatives for the oral response, provided data 
relevant to this expectation. Although the number of items was limited, some 
tentative conclusions may be drawn from an analysis of these data. 

1. When children were presented with a musical change softer , the 
most frequent incorrect response was lower . This appears to be a genuine con- 
fusion in labels, possibly resulting from many experiences with the use of 
lower , and related words such as down , as synonymous with softer (e.g., "Turn 
it down."). 



2. When the musical change was louder , the most frequent incorrect 
response was higher . It is speculated that this is a result of hearing in- 
crease in volume referred to as ^ or higher (e.g., "Turn it up."). 

3. When the music became slower , the most frequent incorrect re- 
sponse was softer . the second most frequent incorrect response was lower . 

4. When the music became faster , the most frequent incorrect re- 
sponses were higher and louder . 

The trends shown in these observations are consistent with expecta- 
tions that subjects would confuse high-loud-fast and low-soft-slow. 

An examination of the relationship between the physical movement 
response and the oral response to individual items in the Overt Measure showed 
low, positive, but nonsignificant agreement between the physical and oral 
responses for all dimeiTsions (Table XL, p. 71). These findings, although 
based on a small number of items, indicate that children’s movements in re- 
sponse to predominant changes in the music did not agree with their spoken 
judgments on what happened in the music. 
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CHAPTER V 



SUMMARY, CONCLUSIONS AND IMPLICATIONS, 
AND RECOMMENDATIONS FOR 
FUTURE RESEARCH 



Suiranary 



The Problem and Its Background 



The purpose of the study was to develop a technique for identifying 
elementary echool children's concepts of the elements of pitch, duration, and 
loudness. (Although timbre was accepted as a musical element, the study was 
limited to pitch, duration and loudness.) Four measures were developed to 
identify and measure concepts of the elements or dimensions named, through a 
multimodal technique, specifically, the Verbal, Listening, Manipulative, and 
Overt Measures. Past tests have involved areas of music aptitude, music 
achievement, and music appreciation, but little has been done in the area of 
musical concepts, essentially cognitive in nature. 

The significance of the present problem evolved from a background 
of publications and speeches devoting considerable attention to the structur- 
ally-based, concept-centered music curriculum. In part, ouch attention re- 
flects a general search for a more effective method of communicating know- 
ledgs; it also reflects an interest in the reeKamination of music as a sub- 
ject with a unique disciplinary structure. 

Although literature relevant to the music education of children re- 
peatedly cites the importance of musical concepts, the music curriculum in 
the elementary school in practice is still based largely on subjective expe- 
rience and judgment rather than empirical evidence. An apparent gap exists 
between the opinions of leading music educators regarding concept-centered 
curriculum content and the identification of a body of knowledge on children s 
concepts of musical elements (dimensions). A review of related research re- 
vealed few studies pertinent to the present investigation. Yet it is meaning- 
less to advocate the concept-centered music curriculum when there is little 
understanding of children's concepts of the basic musical elements. There- 
fore, the present study has attempted to develop a means of identifying such 

concepts. 



The multimodal technique developed in this study, referred to as 
the Battery of Musical Concept Measures, included two written group measures 
(Verbal Measure and Listening Measure) and two nonwritten individual measures 
(Manipulative Measure and Overt Measure) . Each of the measures employed a 
different stimulus-response combination to measure children's concepts of 
pitch, duration, and loudness. 
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Description of Battery of Musical Concept Measures 



The Verbal Measure (reliability = *71) consisted of eighteen 
multiple-choice items, six items measuring each of the dimensions. This 
measure involved comparison and discrimination of natural or music -related 
sounds recalled from experience. A minimal ability to read and comprehend, 
as well as a background of experience with natural and musical sounds, was 
required for the items in this measure. 

The final form of the Verbal Measure evolved from several test-and- 
revise cycles between July of 1965 and August of 1966. Approximately one 
hundred verbal items were devised, proposed, and evaluated by members of the 
research team. After considerable discussion of content and form, the five- 
answer multiple -choice format was adopted. 

Although the appropriateness of a verbal mode for measuring musical 
concepts, aural in nature, is debatable, the consensus of the authors is that 
such a mode can effectively reveal musical concepts. The measure has proved 
effective for group measurement. 

The Listening Measure (reliability = .85) was developed to measure 
the subject's ability to identify predominant changes in the dimensions of 
pitch, duration, and loudness within the multidimensional frame of reference 
of orchestral music. The final form of this measure consisted of eighteen 
short musical excerpts, four to twenty seconds in length, from standard or- 
chestral literature. These items were of three types; 

1. In the first group of six items, each contained a predominant 
change in one dimension, requiring a single judgment of the predominant change 
within the musical example, e.g., faster, lower, softer. 

2. In the sect nd group of six items, each contained predominant 
changes in two dimensions^ requiring a judgment of two changes within the 
example, e.g., higher and louder, faster and softer, lower and slower. 

3. In the third group of six items, each pair of musical excerpts 
differed predominantly in one dimension, requiring judgment of the predomi- 
nant manner in which the second excerpt differed from the first. 

In the development of the Listening Measure, a large number of or- 
chestral excerpts were identified as examples having definite change in a 
single dimension or in two dimensions. These excerpts were analyzed and dis- 
cussed as possible items. After numerous evaluation sessions, administration 
trials, and revisions on the basis of item analyses, the final form of the 
eighteen-item measure evolved. 

The Manipulative Measure (reliability = .66) was developed for use 
with individual pupils and provided a mode for demonstrating understanding of 
pitch, duration, and loudness concepts through manipulation of simple music- 
making devices (e.g., triangle, resonator bells, finger cymbal). Items were 
devised to measure these concepts, and promising items were approved by con- 
sensus of the research team. 
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After several field trials and revisions, the final form of the 
eighteen- item Manipulative Measure evolved with six items measuring each of 
the dimensions of pitch, duration, and loudness. Items were so designed that 
the manual dexterity required for performing each task was minimal. 

The Overt Measure (reliability » .64) was developed to provide in- 
dividual subjects with a mode for demonstrating understanding of pitch, dura- 
tion, and loudness concepts by responding with bodily movements and oral re- 
sponses to excerpts from orchestral literature. Musical examples thirteen to 
twenty-five seconds in length, each containing a predominant change in one of 
the musical dimensions being measured, were identified as items. 

The final form of the Overt Measure consisted of nine musical ex- 
cerpts, with one point each scored for overt and oral responses, a total of 
eighteen points. Each excerpt was played three times; subjects were instruc- 
ted to (1) listen to the excerpt, (2) move to the music, changing movement 
when the music changed, (3) listen again, refer to the card if necessary, and 
tell what change occurred. 

Written and Nonwritten Pitch, Duration, and Loudness scores were 
derived from the four measures. Reliability estimates computed from Main 
Study data for the derived scores were: Written Pitch, .73; Written Duration, 

.66; Written Loudness, .59; Nonwritten Pitch, .65; Nonwritten Duration, .55; 
Nonwritten Loudness, .35. 



Procedures 



Members of the research team proposed items for each section of the 
Battery of Musical Concept Measures. These items were reviewed, revised, and 
accepted or rejected before further validation of accepted items by a panel of 
judges. Several test-and-revise cycles prior to the Pilot Study involved 
trial administrations with various groups of subjects from the University 
Park area and two other Pennsylvania school districts. Items were examined 
according to item analysis criteria described in the study; items not func- 
tioning effectively were either revised or deleted. 

The Battery was administered in the Pilot Study to two classrooms 
(56 subjects) from the Washington County, Maryland, schools. On the basis of 
item analysis, further revisions were made before the Battery was administered 
to the Main Study sample. In the Main Study the Written Measures (Listening 
and Verbal) were administered to sixteen entire classrooms of fourth-grade 
pupils; the Nonwritten Measures (Manipulative and Overt) were administered 
individually to pupils randomly selected from these classrooms. The data ob- 
tained were statistically analyzed to yield reliability estimates, correla- 
tion matrices , item analyses, and other statistics. 

Reading achievement and IQ scores, for the segment of the sample 
with available data on California Mental Maturity and California Achievement 
Tests, were correlated with the scores from the Battery of Musical Concept 
Measures. Although correlations with the Written Measures were higher than 
with the Nonwritten Measures, all correlations were significant. 



Sub.jects 



Four hundred twenty-nine pupils from sixteen schools in twelve 
Pennsylvania school districts were tested on the group measures, in addition 
to the various trial-administration and Pilot Study samples. Subjects used 
in all trials totaled 601 elementary school children. From the sample of 429, 
214 were randomly selected as subjects for the individual measures. The 
twelve school districts represented a large geographical area and various 
socioeconomic strata. 



Conclusions and Implications 



The Battery of Musical Concept Measures, a pioneer effort in the 
particular area of musical measurement involved, represents the development 
of a technique for identifying children's concepts of pitch, duration, and 
loudness. The Battery has functioned with considerable effectiveness during 
the present study in identifying fourth-grade pupils' musical concepts. Be- 
fore the measures are acceptable for practical use, however, certain revi- 
sions and refinements should be made, and the measures standardized. At 
present, the measures are adequate for research purposes. 

The Written Measures (Listening and Verbal), with their high reli- 
abilities and efficiency of administration, show exceptional potential as 
practical measures. The Nonwritten Measures (Manipulative and Overt) are 
more time-consuming than the Written Measures in terms of the individual ad- 
ministration technique involved, and required trained personnel; nevertheless, 
these measures appear to possess particular possibilities for use with chil- 
dren having reading and/or language problems, or with younger children in the 
pre-reading stage. 

In a number of cases it appeared that the subject possessed the con- 
cept being measured but exhibited confusion regarding the appropriate label 
(terminology). This suggests a need for increased emphasis on teaching labels. 
A test-teach-retest procedure with the Battery should yield further informa- 
tion concerning this aspect of the study, and also should furnish additional 
evidence on the validity of the measures. 

In the development of the Battery of Musical Concept Measures, con- 
siderable effort was devoted to devising loudness and duration items of appro- 
priate difficulty. The Manipulative Measure posed particular problems re- 
lated to devising sufficiently difficult loudness items; it was necessary to 
approach a physiological level of discrimination to frame items with diffi- 
culty indices lower than .80. (It should be noted that high means or high 
difficulty indices reported on tests may indicate that the items devised were 
faulty, or that the subjects possessed the concept being measured.) Thv° 
means and difficulty indices for Nonwritten Duration and Loudness were quite 
high, manipulative loudness being highest even after numerous efforts to de- 
vise more difficult items. Williams (56) and Riley and McKee (42) have in- 
dicated from their observations that the concept of loudness seems to develop 
early; Riley and McKee particularly noted young children's difficulty in 
learning pitch discrimination. Petzold (39) concluded from his study that 
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children perceive rhythm more accurately than melody, with later performance 
plateaus occurring for pitch-related tasks. In summarizing eKperimental 
studies, Valentine (53) indicated that non-trained listeners notice rhythm 
more than melody or harmony. These conclusions support indications from the 
present study. The authors believe that most children possess a more highly 
developed concept of loudness, and possibly of duration, than of pitch, as 
expressed on the Manipulative and Overt Measures. 

In the Overt Measure, subjects evidenced change of overt movement 
in response to musical changes of duration and loudness; however, the major- 
ity of subjects failed to show such changes when pitch was the predominant 
change in the music. In correlating the overt responses with the subject's 
oral responses to the same musical stimuli, no significant relationship was 
found, indicating that subjects who could show the change might be unable to 
label the change, and that subjects who could orally indicate the change 
might not be able to demonstrate the change by overt movement. 

Although the data are not conclusive, it appears that some children 
confuse the three terms high, loud, and fast, and the three terms low, soft, 
and slow. This may be due to the frequent association in music of these groups 
of characteristics, or to a general confusion of labels. 

Most of the subjects appeared to enjoy participating in the study. 
Some children who were not selected from the classroom samples for the indi- 
vidual measurement expressed disappointment at being bypassed. Conversely, a 
few inhibited or self-conscious children seemed to find it difficult to move 
freely in the Overt Measure activity. Some of the apparent pupil enthusiasm 
may have resulted from the change in routine introduced by the testing, but a 
majority of the subjects evidently enjoyed the variety of musical experiences 
represented in the measures. 



Recommendations for Future Research 



Among the many truisms on the subject of research, there is one to 
the effect that "research begets research." The present study has suggested 
a number of avenues for future research. The authors propose to extend the 
study in the following ways, when funds are available: 

1. The present chapter refers to a test-teach-retest procedure to 
be undertaken for the purpose of construct validation. Application has been 
made for funds to cover this procedure. 

2. The dimension of timbre should be incorporated in both Written 
and Nonwritten Measures. This dimension was excluded from the present study 
after careful consideration. It is believed that this was a wise decision. 
The dimension is extremely difficult to place within a verbal mode, as a 
search of the literature describing instrumental and vocal timbre attests. 
There are few verbal descriptions of the tone quality of various instruments, 
and those that exist are traditional (in some cases almost whimsical) and 
certainly subjective. 
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3. The subject of children's growth in concept development (origi 
nally part of the present study in its unrevised form) should be explored 
through administering the measure to samples of subjects at different age or 
grade levels. (Grades 1, 3, and 5 had been noted in the original proposal 
prior to its revision.) 

4. The Written Measures should develop even higher reliability 
through revision of certain items and/or lengthening of the measures. How- 
ever, the authors believe that these measures, which in their present form 
may be administered in two thircy-minute sessions (time approximate), are 
near the tolerance limit of fourth-grade pupils' attention span. Item revi- 
sion, rather than lengthening of measures, seems to be the better approach. 

5. The Written Measures should be examined with the objective of 
adaptation for use with pre-school, first-grade, and second-grade pupils. 
This would make possible a longitudinal study of the development of musical 
concepts . 



6. The Nonwritten Measures are believed to possess potential for 
further development. The authors feel that these measures (Overt and Manipu- 
lative) may prove of particular value with children handicapped verbally or 
visually, children who have problems speaking or reading, and possibly, chil- 
dren with problems relating to emotions or attitudes. Even during the test- 
ing that was part of the measures' development, the research team tentatively 
identified a few children who appeared to have certain of the problems men- 
tioned. (These measures, however, when refined, should be effective with 
"normal" children also.) Of course, it is true that individual observation 
of a child, such as takes place during the administration of the Nonwritten 
Measures, is informative and revealing in itself. 

In addition to the preceding proposed research, studies are needed 
in the following areas: 

1. Studies are needed of children's reactions to various types of 
musical input constituting the daily media-derived environment as this affects 
concept development. 

2. Further exploration is needed of the present research team's 
observation that the pitch stimulus in itself does not cause overt reaction 
manifested as physical movement. 

3. Research in depth is indicated on the subject of how children 
acquire the observed semantic confusion as to musical terms or labels. 

4. Studies of musical concepts based on children's free verbal re- 
action to musical excerpts could be revealing as to descriptions of music 
made in the characteristic vocabulary of children. 

5. The study of children's overt reaction to musical stimulus for 
the purpose of demonstrating possession of musical concepts might be imple- 
mented by use of a videotaped or filmed record. 
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Appendix A; Statistical Tables from Prepllot Studies 



TABLE page 

LI Verbal Measure Item Analysis, Estella Trial 94 

LII Verbal Measure Item Analysis, Central Dauphin Trial 95 

LIII Verbal Measure Item Analysis, Central Dauphin and 

Estella Combined 96 

LIV Listening Items, Percentages Correct, University 

Groups Trials 97 

LV Listening Measure Item Analysis, University Area 

Trial 98 

LVI Listening Measure Item Analysis, Estella Trial 99 

LVII Means and Standard Deviations for the Listening 

Measure, Prepilot Trials 100 

LVIII Listening Measure Item Analysis, Central Dauphin and 

Estella Combined 101 

LIX Manipulative Measure Item Analysis, Estella Trial • 102 

LX Manipulative Measure Item Analysis, Bellefonte Trial 103 

LXI Overt Measure Item Analysis, Estella Trial 104 
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Verbal Measure, Form 1 



PICK THE 


BEST ANSWER 


1. 


As a 


melody goes up, the tones must become: 




a. 


softer 




b. 


louder 




c. 


higher 




d. 


lower 






faster 


2. 


As a 


melody goes down, the tones must become: 




a 9 


faster 




b. 


higher 




c. 


louder 




d. 


softer 




e. 


lower 


3. 


A short note could be: 




a* 


loud 




b. 


soft 




c* 


high 




d. 


low 




e. 


any of these 


4. 


When 


music gets louder we hear: 




a. 


more sound 




b. 


less sound 




c. 


higher sound 




d. 


lower sound 




e. 


slower sound 


5. 


Fast 


music is most like someone; 




a* 


walking 




b. 


running 




c. 


creeping 




d. 


hopping 




e. 


standing 


6. 


If you played a song faster and louder the second 




a * 


it would take less time than the first time 




b. 


it would take more time than the first time 




c. 


it would be higher than the first time 




d. 


it would be lower than the first time 




e. 


any of these 
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7. When music gets softer, we hear: 

a. higher sound 

b. lower sound 

c. faster sound 

d. less sound 

e. more sound 

8. Slow music is most like: 

a. running 

b. racing 

c. creeping 

d. galloping 

e. whirling 

9. High music is most like: 

a. thunder 

b. father's voice 

c. singing birds 

d. rumbling trucks 

e. roaring cannons 

10, Low music is most like: 



a. 


doorbell 






b. 


thunder 






c. 


chirping birds 




d. 


mother's voice 




e. 


telephone 






As a 


band in a 


parade gets near to you 


fl • 


the music 


seema 


louder 


b. 


the music 


seems 


softer 


c. 


the music 


seems 


higher 


d. 


the music 


seems 


lower 


e. 


the notes 


seem 


faster 



12. If you sing out of tune, this means: 

a. you sing too loudly 

b. you sing too softly 

c. you sing too quickly 

d. you sing ahead or behind the class 

e. you sing higher or lower than the right pitches 

13, Which instrument can sound several notes at the same time? 



a. drum 

b. piano 

c. trumpet 

d. flute 
e< voice 
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14. If you wrote a song for marching, it would be; 

a. faster than a lullaby 

b. slower than a lullaby 
C. soft and slow 

d. loud and slow 

e. high and slow 



15. A high note could be: 



a . loud 

b. soft 

c . long 

d . short 

e. any of these 



16. The opposite of a high tone is: 



a. loud tone 

b. soft tone 

c. low tone 

d . long tone 

e. short tone 



17. In marching we step to the; 

a. loudness of the music 

b. softness of the music 

c. the beat of the music 

d. the lowness of the music 

e. any of these 
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18. A loud note could be: 



a. 


high 








b. 


low 








c. 


long 








d. 


short 








e. 


any of 


these 






Pitch tells 


us : 






a. 


how soft the 


music 


is 


b. 


how fast the 


music 


is 


c. 


how high the 


music 


is 


d. 


how light the music 


is 


e. 


all of 


these 







20. If you played a song lower and louder the second time, 
it would: 



a. take more time than the first time 

b. take less time than the first time 

c. take more notes the second time 

d. have different notes the second time 

e. have the same notes the second time 



21. Two sounds of equal length must be: 



a. equally soft 

b. equally short 

c. equally loud 

d. equally high 

e. any of these 



22. If you were to sing a song slower and higher the second time, 



a. it would be softer 

b. it would be louder 

c. it would take more time to sing 

d. it would take less time to sing 

e. it would take the same amount of time 

23. Playing a song softer and lower the second time would mean: 



a. you would finish it sooner the second time 

b. it would sound like a different tune the second time 

c. there would be no change in the tune 

d. it would be faster the second time 

e. it would be slower the second time 
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Name 

Hagerstown 

Verbal Measure, Form 2 

Choose the best answer to complete each statement. Fill in the answer space 
before the answer you think is best . Choose only one answer for each number. 

Sample 

If you sing cut of tune, this means 
you sing too quickly, 
you sing too softly, 
you sing too loudly, 

you sing higher or lower than the right pitches, 
you sing ahead of or behind the class, 
band in a parade gets near to you, 
the music seems louder, 
the music seems higher, 
the music seems softer, 
the music seems lower, 
the notes seem faster, 

2, Low music is most like 
1 i a doorbell, 

1 I thunder, 

1 I chirping birds, 

] I mother's voice. 

I I the telephone. 

3, If you were to sing a song slower and higher the second time, 

be softer, 
be louder, 

take more time to sing, 
take less time to sing, 
take the same amount of time. 



1 I it would 
I I it would 
n it would 
I I it would 
I I it would 



□ 

□ 

□ 

□ 

□ 

1, As a 

□ 

□ 

□ 

□ 

□ 



2 



4. If you wrote a song for marching, it would be 



□ 


faster than a lullaby. 


1 ° 


slower than a lullaby. 


1 ° 


soft and slow. 


1 ° 


loud and slow. 


1 □ 


high and slow. 



5. A short note could be 



1 ° 


loud . 


1 ° 


soft. 


1 □ 


high. 


1 ° 


low. 


□ 


any of these. 



6. When music gets softer, we hear 



1 ° 


higher sound. 


□ 


lower sound. 


□ 


faster sound. 


□ 


less sound. 


□ 


more sound. 



7. If you played a song higher and softer the second time, it would 



□ 


sound bigger the second time. 


□ 


sound smaller the second time.. 


□ 


take less time the second time. 


□ 


take more time the second time. 



U use more notes the second time. 

Ill 

O 
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10 . 



11 . 



If you played a song faster and louder the second time, 
I I it would take less time than the first time. 

I I it would take more time than the fir<Jt time. 

I I it would be higher than the first time. 

I I it would be lower than the first time. 

I I any of these. 

The opposite of a high tone is a 
I I loud tone. 

I I soft tone. 

I I low tone. 

I I long tone. 

I I short tone. 

In marching, we step to the 
I I loudness of the music. 

I I softness of the music. 

I I the beat of the music. 

I I the lowness of the music. 

I I any of these. 

When music gets louder, we hear 
I I more sound. 

I I less sound. 

I I higher sound. 

I I lower sound. 

I I faster sound. 
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16. 


Slow 


music is most like 




□ 


running . 




□ 


skipping. 




□ 


creeping. 




□ 


galloping. 




□ 


whirling. 


17. 


VHiich instrument can sound several notes at the same time? 




□ 


drum 




□ 


piano 




□ 


trumpet 




□ 


flute 




□ 


voice 


18. 


A high note could be 




□ 


loud . 




□ 


soft . 




□ 


long. 




□ 


short . 




□ 


any of these. 
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Musical Sources of Items and Correct Responses 
Listening Measure, Form 1 



Musical Source Correct Response 



1. 


Sullivan: lolanthe. Overture 


louder 


2. 


Sullivan: Pinafore, Overture 


softer 


3. 


Sullivan: Pinafore, Overture 


slower 


4. 


Sullivan: Pinafore, Overture 


higher 


5. 


Bizet: Carmen, Habanera 


lower 


6. 


Lecuona: Malaguena 


faster 


7. 


Beethoven: Symphony No. 3, Third Movement 


louder 


8. 


Bach-Stokowski: Toccata in D Minor 


lower 


9. 


Tchaikovsky: Swan Lake, Czardas 


Easter 


10. 


Grofe: Grand Canyon Suite, Sunrise 


higher 


11. 


Handel: Water Music, Andante 


slower 


12. 


Handel: Fireworks, Lenteraent 


softer 


13. 


Beethoven: Symphony No. 1, Third Movement 


higher and louder 


14. 


Beethoven: Leonore Overture No, 2 


slower and lower 


15. 


Balfe: The Bohemian Girl, Overture 


slower and softer 


16. 


Beethoven: Leonore Overture No. 2 


higher and louder 


17. 


Beethoven: Leonore Overture No. 2 


higher and louder 


• 

00 
r— 1 


Brahms: Variations on a Theme by Haydn 


softer 


19. 


Sullivan: Pirates of Penzance, Overture 


faster 


20. 


Sullivan: Pirates of Penzance, Overture 


slower 


21. 


Beethoven: String Quartet No. 3, Third Movement 


higher 


22. 


Tchaikovsky: Swan Lake, Waltz 


louder 


23. 


Tchaikovsky: Swan Lake, Little Swans 


higher 


24. 


Beethoven: String Quartet No. 5, Third Movement 


lower 
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Answer Sheet, Listening Measure, Form 1 



1 . 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 

16. 

17. 

18. 

19. 

20 . 

21 . 

22 . 

I 

23. 

24. 
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Musical Sources of Items and Correct Responses 
Listening Measure, Form 2 





Musical Source 


Correct Response 


1. 


Sullivan: Pinafore, Overture 


softer 


2. 


Sullivan: Pinafore, Overture 


slower 


3. 


Sullivan: Pinafore, Overture 


higher 


4. 


Bizet: Carmen, Habanera 


lower 


5. 


Lecuona: Malaguena 


faster 


6. 


Beethoven: Symphony No. 3, Third Movement 


louder 


7. 


Bach-Stokowki: Toccata in D Minor 


lower 


8. 


Handel: Water Music, Andante 


s* lower 


9. 


Handel: Water Music, Lentement 


softer 


10. 


Beethoven: Leonore Overture No. 3 


lower 


11. 


Beethoven: Symphony No. 1, Third Movement 


higher and louder 


12. 


Tchaikovsky: Swan Lake, Czardas 


faster and softer 


13. 


Beethoven: Leonore Overture No. 2 


higher and louder 


14. 


Beethoven: Leonore Overture No. 2 


higher and louder 


15. 


Brahms: Variations on a Theme by Haydn 


softer 


16. 


Sullivan: Pirates of Penzance. Overture 


faster 


17. 


Sullivan: Pirates of Penzance. Overture 


slower 


18. 


Tchaikovsky: Swan I.nke, Waltz 


higher 


19. 


Tchaikovsky; Swan Lake, Little Swans 


higher 


20. 


Beethoven: String Quartet No. 5, Third Movement 


lower 
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Answer Sheet, Listening Measure, Form 2 

We will hear some short musical selections. Each one will change in 
some way. Listen to the complete example and decide what is the most 
important change. Choose the correct answer on your paper, but listen again 
as the same example is repeated before marking your answer. Fill in the 
square beside the answer that you think is right. 

Sample ^ 



□ 


faster 




□ 


faster 


□ 


slower 




□ 


slower 


□ 


louder 




□ 


louder 


□ 


softer 




□ 


higher 


□ 


higher 




□ 


lower 


□ 


faster 


5. 


□ 


faster 


□ 


slower 




□ 


slower 


□ 


louder 




□ 


softer 


□ 


softer 




□ 


higher 


□ 


lower 




□ 


lower 


□ 


faster 


6. 


□ 


faster 


□ 


slower 




□ 


slower 


□ 


louder 




□ 


lower 


□ 


higher 




□ 


louder 


□ 


lower 




□ 


softer 


□ 


faster 


7. 


□ 


louder 


□ 


slower 




□ 


softer 


□ 


louder 




□ 


lower 


□ 


higher 




□ 


higher 


□ 


lower 




□ 


faster 
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8 



12 



( 



9 . 



10 . 



11 . 



□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 



softer 

higher 

lower 

faster 

slower 

softer 

louder 

higher 

lower 

slower 

louder 

higher 

Icwer 

faster 

slower 

higher and slower 
higher and louder 
lower and softer 
lower and louder 
louder and faster 



□ slower and softer 
] I slower and louder 
Q faster and softer 

□ faster and louder 

□ higher and louder 

n higher and softer 

□ higher and louder 

0 lower and softer 

□ lower and louder 

□ lower and faster 

1 I higher and softer 

□ higher and louder 

□ higher and slower 
I \ lower and slower 

□ lower and faster 
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sroup o£ examples is different. This time yon must listen to two 

the first Pill*-' *“» the second example is most different from 

the firste Fill in the square beside the word which tITTi this difference. 
These examples will not be repeated. airrerence. 

Sample 



15. 



16. 



17. 



i [ louder 


18. 


□ 


faster 


i 1 softer 




□ 


slower 


1 1 higher 




□ 


softer 


1 1 slower 




□ 


louder 


1 1 faster 




□ 


lower 


r~1 louder 


19. 


□ 


lower 


1 f softer 




□ 


higher 


1 f faster 




□ 


faster 


1 } slower 




□ 


slower 


1 1 higher 




□ 


softer 


1 1 higher 


20. 


□ 


lower 


1 1 softer 




□ 


higher 


1 1 louder 




□ 


louder 


1 1 slower 




D 


faster 


1 1 faster 




D 


slower 


n higher 








1 1 softer 








1 1 louder 








r~l slower 








ri faster 
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Musical Sources of Items and Correct Responses 
Listening Measure, Form 3 



Musical Source 



Correct Response 



1. ' Sullivan; Pinafore , Overture 

2. Sullivan: Pinafore , Overture 

3. Sullivan; Pinafore , Overture 

4. Bizet; Carmen , Habanera 

5. Lecuona: Malaguena 

6. Beethoven: Symphony 3, Third Movement 

7. Bach- Stokowski; Toccata in D Minor 

8. Handel; Water Music , Andante 

9. Handel; Fireworks , Lentement 

10. Beethoven: Leonore Overture 2 

11. Bach-Stokowski; Toccata ^ D Mino ^ 

12. Holst: The Planets , Venus 

13. Beethoven; Symphony Third Movement 

14. Holst: The Planets , Jupiter 

15. Tchaikowsky; Swan Lake , Czardas 

16. Beethoven: Leonore Overture No. 2 

17. Brahms; Variations on a. Theme ^ ffayd n 

18. Brahms; Variations ,on a Theme ^ Hay^ 

19. Tchaikowsky: Swan Lake , Little Swans 

20. Tchaikowsky; Swan Lake , Waltz 

21. Sullivan: Pirates of Penzance, Overture 

22. Sullivan; Pirates of Penzance , Overture 

23. Brahms: Concerto for Violin and Cello 

24. Ravel: Ma Mere L'^, Pavanne 



softer 

slower 

higher 

lower 

faster 

louder 

lower 

slower 

softer 

lower 

louder and lower 
slower and softer 
higher and louder 
faster and softer 
faster and softer 
higher and louder 
lower 
faster 
higher 
louder 
faster 
slower 
lower 
softer 
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Answer Sheet, Listening 



Measure, Form 3 



.... .™ ..... “ 
some way Listen the co^lete^examp^^^^^ 1„ 

^7the“ ani^^a^ple IS repeated before ^rhlng ^our answer. Then 
djp uiic , onoMpr that vou thxnK %s rignu. 



^hp 



SAMPLE A 




r~] faster 


n faster 


1 1 slower 


j 1 slower 


j 1 louder 


Q louder 


j 1 softer 


j 1 higher 


j \ higher 


1 1 lower 


SAMPLE B 


4. 


1 1 faster 


1 \ faster 


Q slower 


1 1 slower 


1 \ louder 


\ 1 louder 


\ 1 higher 


1~1 higher 


\ 1 lower 


□ lower 

5- _ 


1. 

1 1 faster 


\ 1 faster 


\ \ slower 


\ I slower 


1 1 louder 


1 \ softer 


\ 1 softer 


1 i higher 


□ lower 


j \ lower 
6. 


2. ^ ^ 

1 1 faster 


j \ louder 


] 1 slower 


□ softer 


1 1 louder 


\ \ faster 


D higher 


j \ slower 


] 1 softer 


□ lower 



□ louder 

□ softer 
I 1 lower 
Q higher 

□ faster 



9. 



10 . 



□ softer 
Q higher 

□ lower 

I I faster 

□ slower 

□ softer 

□ louder 

□ higher 

□ lower 

□ slower 

□ louder 

I I higher 
Q lower 
I I faster 
□ slower 
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DIRECTIONS: 

Look at your paper beginning with Number 11. In this part of the test 
there are afte? each square. The music will do t«o things at once. 

It may get higher and slower or faster and louder. Be sure to choose the 
answer that has both parts right. 



11 . 

□ higher and slower 

□ higher and louder 

□ lower and softer 

□ lower and louder 

□ louder and faster 

12 . 



□ lower and softer 

□ lower and louder 
faster and softer 

□ slower and softer 

□ faster and louder 

n higher and slower 

□ higher and louder 

□ lower and softer 

□ softer and faster 

□ louder and faster 

14. „ 

I \ slower and softer 

□ slower and louder 

□ faster and softer 

□ faster and louder 
□ higher and louder 



I slower and softer 
I I slower and louder 
P~| faster and softer 
I I faster and louder 
pn higher and louder 
16. ^ 

LJ higher and softer 
I I higher and louder 
I I higher and slower 
j I lower and slower 
□ lower and faster 
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DIRECTIONS: 



- 3 - 



The final group of examples is different. This time you must listen to 
two different examples. Decide how the second example is most different from 
the first. Fill in the square beside the word which tells this difference. 
Listen carefully the first time, because these examples will not be repeated. 



SAMPLE 


c 


19. 




23. 




□ 


faster 


□ 


lower 


□ 


lower 


□ 


slower 


□ 


higher 


□ 


higher 


□ 


higher 


□ 


slower 


□ 


faster 


□ 


lower 


□ 


faster 


□ 


slower 


□ 


louder 


□ 


softer 


□ 


softer 


SAMPLE 


D 


20. 




24. 




□ 


louder 


□ 


louder 


□ 


lower 


□ 


softer 


□ 


slower 


□ 


higher 


□ 


higher 


□ 


faster 


□ 


faster 


□ 


slower 


□ 


higher 


D 


louder 


□ 


faster 


□ 


lower 


□ 


softer 


□ 


higher 


21. 

□ 


higher 






□ 


lower 


□ 


softer 






□ 


softer 


□ 


louder 






□ 


faster 


□ 


slower 






□ 


slower 


□ 


faster 






18. 

□ 


lower 


22. 

□ 


higher 






□ 


louder 


□ 


softer 






□ 


softer 


□ 


louder 






□ 


faster 


□ 


slower 






□ 


slower 


□ 


faster 
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Musical Sources of Items and Correct Responses 
Listening Measure, Form 4 



Musical Source 



Correct Response 



Sample A 

Sullivan; lolanthe . Overture 
Sample B 

Beethoven; Leonore Overture NOc 2 

1. Sullivan; Pinafore . Overture 

2. Prokofiev; Classical Symphony . First Movement 

3. Bach- Stokowski; Toccata in D Minor 

4. Beethoven; Symphony No . _3> Third Movement 

5. Handel; Water Music . Hornpipe 

6. Lecuona; Malaguena 

Sample C 

Holst; The Planets . Jupiter 

7. Stravinsky; Petrouchka . Russian Dance 

8. Beethoven; Leonore Overture No . 2 

9. Tchaikowsky; Swan Lake , Czardas 

10. Prokofiev; Lieutenant Ki.ie , Kije's Wedding 

11. Beethoven; Consecration of the House 

12. Lizst; Grand Galop Chroma tigue 

Sample D 

Holst; The Planets . Jupiter 
Sample E 

Brahms ; Variations on a Theme Haydn 

13 . Brahms ; Variations on _a Theme by Haydn 

14. Tchaikowsky: Swan Lake . Little Swans 

15. Tchaikowsky; Swan Lake . Waltz 

16. Glinka: Russian and Ludmilla , Overture 

17. Ravel: Ma Mere I'Oye, Pavanne 

18. Tchaikowsky, Capriccio Italien 



louder 

lower 

softer 

higher 

lower 

louder 

slower 

faster 

softer and faster 
softer and slower 
higher and louder 
softer and faster 
slower and lower 
higher and faster 
louder and lower 

lower 

softer 

faster 

higher 

louder 

lower 

softer 

slower 
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Answer Sheet, Listening Measure, Form 4 



DIRECTIONS : 



We will hear some short musical examples. Each one will change in some 
way. Listen to the complete example and decide on the most Important change. 
Read the answers on your paper, but before marking your answer listen again 
as the same example is repeated. After hearing the example the second time, 
fill in the square beside the answer you think is right. Be sure you fill in 
only one answer space for each example. 



SAMPLE A 

I I faster 

I I slower 

[3] louder 

I I softer 

I I higher 

SAMPLE B 

I I louder 

□ higher 

D 

I I faster 
I I slower 

1 . 

I I faster 
I I slower 
I I louder 
I I softer 
I I lower 

2 . 

□ faster 
j I s lower 

□ louder 

D higher 
pi lower 



I I louder 
I [ softer 

□ lower 
I I higher 
( I faster 

4. 

□ louder 
□ softer 
I I faster 
I I slower 
I [ lower 
5. 

I I softer 

□ higher 
I [ lower 
I [ faster 
P] slower 

6 . 

I [ faster 
□ slower 
□ softer 
n higher 
P] lower 
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DIRECTIONS; 

In this part there are two answers after each square. Listen as the 
music changes in two ways. For example, the music may get higher and slower- 
or faster and louder. Be sure to choose the answer that has both parts right 
but fill in only one answer space for each example. 



SAMPLE C 
□ 


slower 


and 


softer 


10. 


□ 


higher and softer 


□ 


slower 


and 


louder 




□ 


higher and louder 


□ 


faster 


and 


softer 




□ 


higher and slower 


□ 


faster 


and 


louder 




□ 


lower and slower 


□ 


higher 


and 


louder 




□ 


lower and faster 


7. 

□ 


slower 


and 


softer 


11. 


□ 


lower and louder 


□ 


slower 


and 


louder 




□ 


lower and faster 


□ 


faster 


and 


softer 




□ 


higher and faster 


□ 


faster 


and 


louder 




□ 


higher and slower 


□ 


higher 


and 


louder 




□ 


louder and slower 


8. 

□ 


higher 


and 


softer 


12. 


□ 


lower and softer 


□ 


higher 


and 


louder 




□ 


lower and louder 


□ 


higher 


and 


slower 




□ 


higher and softer 


□ 


lower 


and 


louder 




□ 


slower and softer 


□ 


lower 


and 


fastei* 




□ 


higher and louder 



9. 

I I slower and softer 

□ slower and louder 
I I faster and softer 

□ faster and louder 
I I higher and louder 




127 



DIRECTIONS; 



In this group each example will have two different parts. Decide how the 
second part is most different from the first. Fill in the square beside the 
word which best describes this difference. Fill in only one answer space for 
each example. Listen carefully the first time because these examples will not 
be repeated. 



SAMPLE D 

□ 


faster 


15 . 


□ 


louder 


□ 


slower 




□ 


slower 


□ 


higher 




□ 


faster 


□ 


lower 




□ 


higher 


□ 


louder 




□ 


lower 


SAMPLE E 
□ 


louder 


16 . 


□ 


lower 


□ 


softer 




□ 


higher 


□ 


higher 




□ 


faster 


□ 


slower 




□ 


slower 


□ 


faster 




□ 


softer 


13 . 

□ 


lower 


17 . 


□ 


lower 


□ 


louder 




□ 


higher 


□ 


softer 




□ 


faster 


□ 


faster 




□ 


louder 


□ 


slower 




□ 


softer 


14 . 

n 


lower 


18 . 


□ 


lower 


n 


higher 




□ 


faster 


n 


slower 




□ 


slower 


n 


faster 




□ 


louder 


□ 


softer 




□ 


softer 
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Directions for Administration, Listening Measure, Pilot Study 

Distribute answer sheets. Make sure that each child has pencil with eraser. 
Instruct children to write their names in the space provided on page 1. 



START TAPE 



*"At the top of your answer sheet there are some directions. Read 
them silently to yourself while I read them to you. /"We will hear 
some short musical eKamples. Each one will change in some way. 
Listen to the complete example and decide on the most important 
change. Read the answers on your paper, but before marking your 
answer, listen again as the same example is repeated. After hearing 
the example the second time, fill in the square beside the answer 
you think is right. Be sure you fill in only one answer space for 
each example."/ 

"Let's try a sample. Listen to Sample A and hear how it changes." 
(Sample A) 

"Look at your answer sheet and decide which answer best describes 
how the music changed. (Pause) Listen again before marking your 
answer to see whether you were correct," 

(Sample A repeated) 

"Mark your answer for Sample A," 

STOP TAPE after "Sample A." 



ASK; "Which word describes the most important change?" 

ALLOW CHILDREN TO ANSWER (Correct answer is "louder"). 

SAY: Fill in the space beside "louder." If you had a different answer, erase 

Its’* 

WHEN ALL HAVE CORRECTED SAMPLE A, SAY "Let's try another sample." 



START TAPE 



"Listen to Sample B." 

(Sample B) 

"Decide what you think is the biggest change. Look for that answer 
on the sheet, but listen again before marking your answer." 

(Sample B repeated) 



^Indented items are on the tape. 
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"Mark your answer for Sample B." 



STOP TAPE after "Sample B." 



AFTER CHILDREN HAVE MARKED ANSWERS, ASK, "What did you m^rk?" AFTER CHILDREN 
HAVE ANSWERED, "Lower is right." "Are there any questions about how to do 
this?" ANSWER ANY QUESTIONS ABOUT PROCEDURE, THEN SAY, "Let's begin with 
Number 1. 



START TAPE 



Items 1 to 6 

"Turn to the directions at the top of page 2." 

(Pause) /^n this part there are two answers after each square. 
Listen as the music changes in two ways. For example, the music may 
get higher and slower--or faster and louder. Be sure to choose the 
answer that has both parts right, but fill in only one answer space 
for each exampl e."/ 

"Listen to a sample." 

(Sample C) 

"Decide the two most important ways the music changed and find the 
answer with both parts correct; but listen again before marking the 
answer." 

(Sample C repeated) 

"Mark your answer for Sample C." 

STOP TAPE after "Sample C." 

AFTER CHILDREN HAVE MARKED ANSWERS, ASK "What answer did you mark?" ALLOW 
CHILDREN TO ANSWER. (Correct answer is "faster and softer.") SAY "Be sure 
you have filled in the one space before the two words, faster and softer. 
Choose the one square with both correct answers, not two squares. Are there 
any questions?" 



START TAPE after any questions have been answered. 

Items 7 to 12 

"Turn to the directions at the top of page 3." 

(Pause) this group each example will have two different parts. 

Decide how the second part is most different from the first. Fill 
in the square beside the word which best describes this difference. 
Fill in only one answer space for each example. Listen carefully 
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the first time because these examples will not be repeated^/ Let's 
try a sample. Fill in the answer space beside the word which tells 
how the second part was different from the first part." 

(Sample D--two parts) 

STOP TAPE after "Sample D." 

ASK "What is the correct answer?" GET THE CORRECT ANSWER (lower) FROM THE 
CLASS. AFTER CHILDREN HAVE HAD TIME TO CORRECT THE ANSWER, "Let's try another 
sample. " 



START TAPE 



(Sample E) 

"Mark your answer for Sample E." 
STOP TAPE after "Sample E." 



GIVE CHILDREN TIME TO MARK AN ANSWER. ASK, "What answer do you have?" AFTER 
CHILDREN ANSWER (Correct answer is softer), ASK "Are there any questions?" 
AFTER ANSWERING QUESTIONS, "Let's go ahead with Number 13. Remember, there 
are two different parts for each example and these are played only once." 



START TAPE 



Items 13 to 18. 

STOP TAPE at the end. of Number 18, second part. 



GIVE CHILDREN TIME TO COMPLETE THE ANSWER. ASK THEM TO FOLD PAGE BACK SO THAT 
THEIR NAMES ARE ON THE TOP SHEET. COLLECT THE PAPERS. 
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Answer Sheet, Listening and Overt Measures, 
Final Forms, Preliminary Validation 



Sample A Overt Sample 

Sample B 1. 



1 . 



2 . 



2 . 



3 . 



3 . 



4 . 



4 . 5 . 

5 . 6 . 

6 . 7 . 

Sample C 8. 

7 . 9 . 



8 . 



9 . 



10 . 

11 . 



12 . 



Sample D 
Sample E 

13 . _ 

14 . _ 

15 . _ 

16 . _ 

17 . _ 

18 . 
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Manipulative Measure, Form 1 



1. Play 2 bells - which is higher 

2. Play 3 bells - which is the middle pitch 

3. Play 5 bells - arrange in order, play high to low 

4. Play 4 glasses - arrange in order, play low to high 

Experiment with bongos (play soft and loud) 

5. Play higher drum soft 

6. Play lower drum loud 

7. Play drum fast and get slower 

8. Play drum fast and soft 

9. Play drum slow and loud 

Experiment with orchestra bells (show loud and soft) 

10. Play loudly a scale getting higher 

11. Play softly a scale getting lower 

Experiment with chord organ 

12. Play short, long, and medium notes 

13. Using black buttons, play the chord which is highest, play lowest 

14. Using the 3 wood blocks, play the highest 

15. Play 3 finger cymbals in order from low to high 
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Manipulative Measure, Form 2 



NAME 



ITEM 


RESPONSE 
Right Wrong 


TESTING 


DRUM 

1. Play fast and softly 






L 


2. Play loudly and slowly 






L 


3. Play loudly and get softer 






L 


4. Play softly and get louder 






L 


5. Play fast and get slower 






D 


6. Play slowly and get faster 






D 


WATER GLASSES 

7. Play from low to high 






P 


UKULELE 

8. Play from high to low 






P 


CHORD ORGAN 

9. Using black buttons, find 
highest chord 






P 


10. Using black buttons, find 
lowest chord 






P 


11. Play some short notes, then 
gradually get longer 






D 


12. Play long notes, then 
gradually get shorter 






1 

D 


SIX RESONATOR BELLS 

13. Find highest pitch 






P 


14. Arrange in order and play 
from low to high 






P 


15. Play fast from low to high* 






D 


16. Play slowly from high to low* 






D 


17. Play loudly from low to high* 






L 


18, Play softly from high to low* 






L 



*Do not consider correctness of pitch on these items. 
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Manipulative Measure, Form 3 



Name 



TESTING 


ITEM 


RIGHT 


WRONG 


D 


DRUM 

1. Begin playing fast and gradually get slower. 






D 


2. Begin playing slowly and gradually get faster. 






L 


3. Play softly, loudly, medium. 






D 


CHORD ORGAN 

4. Begin playing short notes and play each one a little longer. 






D 


5. Begin playing long notes, and as you play have each note get 
a little shorter. 






P 


6. Using black buttons - find the lowest sounding chord. 






P 


7. Using black buttons - find the highest sounding chord. 






P 


RESONATOR BARS (4) 

8. Find highest bar. 






P 


9. Arrange the bars in order and play from lowest to highest. 






L 


10. Start at one end softly and make each one louder. 






L 


11. Three mallets on wood block: Which one is loudest? 






P 


12. Ukulele: Play strings in order from lowest to highest. 






L 


13. Tape recorder: Find the softest position. 






D 


14. Metal Square, finger cymbal, resonator bell: Which one 

sounds for the shortest time? 






L 


15. 3 Jingle Bells: Which set is loudest? 






P 


16. 4 Metal Plates: Play from highest to lowest? 






L 


17. 3 Maracas: Which is softest? 






D 


18. Drum, triangle, resonator bells: Which sound is medium in 

length? 
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Manipulative Measure, 



Form 4 



Name 



TESTING 




ITEM 


RIGHT 


WRONG 


D 


1. 


On drum: Begin playing fast and gradually get slower. 






D 


2. 


On drum: Begin playing slowly and gradually get faster. 






P 


3. 


On chord organ: Using black buttons, find the lowest 

sounding chord . 






P 


4. 


Same: Find the highest sounding chord. 






P 


5. 


Resonator bars (4): Find the highest sound. 






P 


6. 


Resonator bars: Arrange in order and play from the lowest 

to the highest. 






L 


7. 


Three mallets on woodblock: Play from loudest to softest. 






P 


8. 


Ukulele: Play strings in order from lowest sounding to 

highest sounding. 






L 


9. 


Tape recorder: Find the soft, medium, and loudest 

positions. 






D 


10. 


Metal square, finger cymbal, resonator bell (F) : Which 

one sounds for the shortest time? 






L 


11. 


Three Jingle bells: Play in order from softest to 

loudest. 






P 


12. 


Four metal plates: Play from highest to lowest (child 

may rearrange these if it helps). 






L 


13. 


Three maracas: Play in order from loudest to softest. 






D 


14. 


Drum, triangle, resonator bells: Which sound is medium 

in length? 






L 


15. 


Metal square, finger cymbal, triangle: Play in order 

from soft to 3,oud. 






D 


16. 


Copper pipes (7, 6, 2): Which pipe sounds for the 

longest time? 






D 


17. 


Metal plates (three, excluding the loudest): Which have 

the longest and shortest sounds? 






L 


18. 


Metal square, woodblock, horseshoe, triangle, and 
resonator bell: Choose the three loudest and play in 

order of loudness from loud to soft. 
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Musical Sources of Items and Dimension Measured 
Overt Measure, Form 1 



Musical Source 



Dimension Measured 



1. Handel: Royal Fireworks Music . La Rejoissance 

2. Handel: vvfater Music , Passepied 

3. Ponchielli: "La Gioconda . Dance of the Hours 

4. Herold: Zampa . Overture 

5. Herold: Zampa , Overture 

6. Tchaikowsky: Marche Slav 

7. Von Suppe; Poet and Peasant, Overture 

8. Saint-Saens: Carnival of the Animals , Fossils 

9. Saint-Saens: Carnival of the Animals , Elephants 

10. Stravinsky; Petrouchka, Russian Dance 

11. Prokofiev: Lieutenant Ki1e . Kije's Wedding 

12. Kabalevsky: The Comedians , March 

13. Villa-Lobos; The Little Train of the Calplra 

14. Khachaturian; Masquerade Suite . Galop 

15. Sullivan; lolanthe . Overture 

16. Sullivan; Pinafore , Overture 

17. Sullivan; Pinafore, Overture 



loudness 

loudness 

duration 

loudness 

duration and loudness 
loudness 
duration 
loudness 
duration 
duration 
pitch 
loudness 
duration 

duration and loudness 
loudness 
pitch 
duration 
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Answer Sheet, Overt Measure, Form 1 





Coded ResDonse (Check any observed Re 


sponse) 


Name 


Item 

No. 


Large to Small 
Movements 


Small to Large 


High to Low 


Low to High 


Fast to Slow 


Slow to Fast 


Smooth to Jerky. 


Jerky to Smooth 


Even to Uneven 


Uneven to Even 


(Last Name First) 

UNCODED RESPONSE OR 
GENERAL COMMENT 
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Musical Sources of Items and Dimension Measured 
Overt Measure, Form 2 





Musical Source 


Dimension Measured 


1. 


Tchalkowskv; Marche Slav 


loudness 


2. 


Von Suppe; Poet and Peasant, Overture 


duration 


3. 


Salnt-Saens; Carnival of the Animals, Elephants 


duration 


4. 


Stravinsky; Petrouchka, Russian Dance 


duration 


5. 


Prokflev; Lieutenant .Klje, Klje's Wedding 


pitch 


6. 


Kabalevsky; The Comedians , March 


loudness 


7. 


Villa-Lobos; The Little Train of the Calplra 


duration 


8. 


Khachaturian; Masquerade Suite, Galop 


duratloh and loudness 


9. 


Sullivan: lolanthe. Overture 


loudness 


10. 


Sullivan; Pinafore, Overture 


pitch 


11. 


Sullivan; Pinafore, Overture 


duration 



Answer Sheet, Overt Measure, Form 2 





1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


No response 
























Larger 
























Smaller 
























Higher 
























Lower 
























Faster 
























Slower 
























Smoother 
























Less Smooth 
























Description o£ 
Uncoded changes 
or 

General remarks 
























Is change Yes 

made at 

proper time? No 














































Is change Yes 

appropriate 

to music? No 














































Changes in music 


softe; 

1 


c £ast( 
2 


ar 

stac 

cato 

to 

legat 

3 


slowi 

0 

4 


er 

smoo 

lowi 

5 


loud( 

ther 

er 

6 


er fas 
7 


ter 

slowi 

fast' 

loud 

8 


loud< 

er 

er 

er 

9 


er low 
hig 
low 
higl 

10 


slower 

11 



er|c 
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Musical Sources of Items and Dimension Measured 
Overt Measure, Form 3 



Musical Source 



Dimension Measured 



1. Tchaikowsky: Marche Slav 

2. Von Suppe: Poet and Peasant . Overture 

3. Stravinsky: Petrouchka . Russian Dance 

4. Beethoven: Symphony No . 3 , Third Movement 

5. Sullivan: Pinafore , Overture 

6. Tchaikowsky: Overture 1812 

7. Moussorgsky-Ravel: Pictures at an Exhibition 

8. Mozart; The Marriage of Figaro . Overture 

9. Wagner: Die Meistersinger . Prelude to Act I 
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Musical Sources of Items and Correct Responses 
Overt Measure, Form 4 




Musical Source 



Correct Response 



1. Tchaikovsky: March Slav 

2. Von Suppe: Poet and Peasant , Overture 

3. Stravinsky: Petrouchka , Russian Dance 

4. Beethove^: Symphony No . 3 ^, Third Movement 

5. Sullivan: Pinafore , Overture 

6. Glinka: Russian and Ludmilla , Overture 

7. Liszt: Hungarian Rhapsody No . 2 

8. Mozart: The Marriage of Figaro , Overture 

9. Bartok: Concerto for Orchestra, Second Movement 



softer 

faster 

slower 

louder 

slower 

higher 

lower 

louder 

higher 



(For Answer Sheet see Appendix C, p.l63.) 
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Verbal Measure 



Name _ 
School 
Date 



Choose the best answer to complete each statement. Fill in the square beside 
the answer you think is best. Choose only one answer for each number. 

Sample 

If you sing out of tune, this means 

□ you sing too fast. 

□ you sing too softly. 

□ you sing too loudly. 

□ you sing higher or lower than the right notes. 

□ you sing faster or slower than the class. 



1. As a band in a parade gets nearer to you, 

\ I the music seems louder. 

I I the music seems higher. 
i I the music seems louder and faster. 

I I the music seems higher and faster. 

I I the music seems lower and slower. 

2. If you arranged five tones in order from highest to lowest, the lowest 
tone would be 

louder than the highest. 

shorter than the highest. 

different from the highest. 

the same as the highest. 

softer than the highest. 



□ 

□ 

□ 

□ 

□ 
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3. If you sang a song slower and higher the second time, 

□ it would be softer, 

□ it would be louder. 

□ it would take less time. 

I I it would take more time. 

I I it would take the same amount of time. 

4. A song for marching would be 

□ faster than a lullaby. 

□ slower than a lullaby. 

□ soft and slow. 

□ loud and slow. 

I I high and slow. 

5. A short note could be 

I I loud . 

□ soft. 
n high. 

I I fast. 

□ any of these. 

6. When music gets softer we hear 

□ higher sound, 

□ lower sound. 

□ slower sound, 

□ less sound. 

□ longer sound. 



146 



3 



7. If you played a song higher and softer the second time, 

I I it would sound bigger. 

□ it would sound smaller. 

I I it would take less time. 

□ it would take more time. 

I I it would use more notes. 

8. If you sang a song faster and louder the second time, 

I I it would take less time. 

I I it would take more time. 

I I it would be higher. 

I I it would be lower. 

I I any of these. 

9. Pick the best body movement to show someone that music gets lower. 

clapping 
r~] stooping 
I i stop moving 

□ stretching 

□ stepping 

10. In marching we step to the 

□ loudness of the music. 

□ tune of the music. 

I I lowness of the music. 

} ( beat of the music. 

□ any of these. 
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11. When music gets louder, 

I I we hear more sound. 

I I we hear less sound. 

I I we hear higher sound. 

I I we hear lower sound. 

I I we hear faster sound. 

12. A loud note could be 

D high. 

□ low. 
n long. 

I I short. 

I I any of these. 

13. As a melody goes up, the tones must become 

I i softer. 

□ louder. 

□ higher. 

I I lower. 

I I faster. 

14. If you played a song louder and lower the second time, 

I I you would finish it sooner. 

I I there would be less sound. 
r*] there would be more sound. 

I I it would be faster. 

I I it would be slower. 
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15. High music is most like 

I I thunder. 
i I singing birds. 

1 1 a cow’s moo. 

1 I a rumbling truck. 

□ roaring cannons. 

16. Slow music is most like 

I 1 running . 
j I skipping. 

□ creeping. 

□ galloping. 

I I spinning. 

17. Which of these instruments can sound several notes at the same time? 

i I drum 
D piano 
\ I trumpet 
I I flute 

D voice 

18. A high note could be 

□ loud . 

0 soft. 

□ long. 

1 I short. 

□ any of these. 
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Difficult Words, Verbal Measure 



arranged 

movement 

instruments 

me lody 

lullaby 

stretching 

music 
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Musical Sources of Items and Correct Responses 
Listening Measure 






Musical Source 



Correct Response 



Sample A 

Sullivan; lolanthe . Overture 
Sample B 

Beethoven: Leonore Overture No . 2 

1. Sullivan; Pinafore . Overture 

2. Prokofiev: Classical Symphony . Movement 1 

3. Bach; Toccata in D Minor 

4. Beethoven: Symphony No . 3, Movement 3 

5. Handel: Water Music . Hornpipe 

6. Lecuona: Malaguena 

Sample C 

Beethoven; Egmont Overture 

7. Stravinsky: Petrouchka . Russian Dance 

8. Beethoven; Leonore Overture No . 2 

9. Tchaikowsky: Swan Lake . Czardas 

10. Prokofiev; Lieutenant Kije . Kije's Wedding 

11. Beethoven: Consecration of the House 

12. Lizst; Grande Galop Chromatidue 

Sample D 

Holst; The Planets . Jupiter 
Sample E 

Brahms : Variations on £ Theme by Haydn 

13. Brahms: Variations on a Theme by Haydn 

14. Tchaikowsky; Swan Lake . Little Swans 

15. Tchaikowsky; Swan Lake . Waltz 

16. Gounod; Faust . Dance of the Nubians 

17. Ravel; Ma Mere L^ Oye , Pavanne 

18. Brahms; Hungarian Dance No . _6 



louder 

lower 

softer 

higher 

lower 

louder 

slower 

faster 

higher and louder 
softer and slower 
higher and louder 
softer and faster 
lower and slower 
higher and faster 
lower and louder 

lower 

softer 

faster 

higher 

louder 

lower 

softer 

slower 



I 
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Answer Sheet, Listening Measure 

Name 



Directions; 

We will hear some short musical eKamples. Each one will change in some 
way. Listen to the complete example and decide on the most important change. 
Read through the possible answers on your paper, but before marking your 
choice, listen again as the same example is repeated. After hearing the 
example the second time, fill in the square beside the answer you think is 



right. 


Be sure you fill in only one 


square for 


each 


example. 


SAMPLE , 

□ 


A 

faster 


3. 


a 


louder 


□ 


slower 




□ 


softer 


□ 


louder 




□ 


lower 


□ 


softer 




□ 


higher 


□ 


higher 




□ 


faster 


SAMPLE 

□ 


B 

louder 


4. 


□ 


louder 


□ 


higher 




□ 


softer 


□ 


lower 




□ 


faster 


□ 


faster 




□ 


slower 


□ 


8 lower 




□ 


lower 


1- □ 


faster 


5. 


□ 


softer 


□ 


slower 




□ 


higher 


□ 


louder 




□ 


lower 


□ 


softer 




□ 


faster 


□ 


lower 




□ 


slower 


2- □ 


faster 


6. 


□ 


faster 


□ 


slower 




□ 


slower 


□ 


louder 




□ 


softer 


□ 


higher 




□ 


higher 


□ 


lower 




□ 


lower 



152 



- 2 - 

In this section each answer has two parts. For eKaraple, the music may 
get higher and slower, or faster and louder. Choose the answer that has 
both parts right, but mark only one square for each example. 



SAMPLE 


c 










□ 


slower 


and 


softer 


□ 


higher and softer 


□ 


slower 


and 


louder 


□ 


higher and louder 


□ 


faster 


and 


softer 


□ 


higher and slower 


□ 


faster 


and 


louder 


□ 


lower and slower 


□ 


higher 


and 


louder 


□ 


lower and faster 



7. □ 


slower 


and 


softer 


11. □ 


lower and louder 


□ 


slower 


and 


louder 


□ 


lower and taster 


□ 


faster 


and 


softer 


□ 


higher and faster 


□ 


faster 


and 


louder 


□ 


higher and slower 


□ 


higher 


and 


louder 


□ 


louder and slower 


8. □ 


higher 


and 


softer 


12. □ 


lower and softer 


□ 


higher 


and 


louder 


□ 


lower and louder 


□ 


higher 


and 


slower 


□ 


higher and softer 


□ 


lower and ! 


Louder 


□ 


slower and softer 


□ 


lower and : 


Easter 




higher and louder 


9. □ 


slower 


and 


softer 






□ 


slower 


ana 


louder 






□ 


faster 


and 


softer 






□ 


faster 


and 


louder 






□ 


higher 


and 


louder 
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In this group each example has first! ’"Ma^^the 

and decide how the second part xs ™°st^d^ff one 

square beside the word which best g these are played, because they 

square for each example. Listen carexuii-y 

Mill not be repeated. 



SAMPLE D 




□ 


faster 


□ 


slower 


□ 


higher 


□ 


lower 


□ 


louder 



SAMPLE E 

□ louder 
I I softer 
n higher 
\ j slower 
\ I faster 

13. □ lower 

\ I louder 
□ softer 
j I faster 
j I slower 

14. □ lower 
[~1 higher 
j I slower 
] I faster 
I I softer 



□ louder 

□ slower 

□ faster 
n higher 

□ lower 

16. 

lower 
n higher 
i \ faster 
□ slower 
I I softer 

17. □ lower 

n higher 
I I faster 
j I louder 
j j softer 

18. □ lower 

I I faster 
I I slower 
j I louder 
I I softer 
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Directions for Administration, Listening Measure 



DIRECTIONS 



ON TAPE 



Distribute answer sheets. 

Make certain each child has a pencil 
with an eraser. 

Instruct children to write their 

names in the space provided on 
the first page. 

Start tape. 



"At the top of your answer sheet 
there are some directions. Read 
them silently to yourself while I 
read them to you; 

'We will hear some short 
musical examples. Each one 
will change in some way. Lis- 
ten to the complete example 
and decide on the most impor- 
tant change. Read through the 
possible answers on your paper, 
but before marking your choice, 
listen again as the same exam- 
ple is repeated. After hearing 
the example the second time, 
fill in the square beside the 
answer you think is right. Be 
sure you fill in only one 
square for each example.' 

Let's try a sample. Listen to Sam - 
ple A and hear how it changes." 



(Sample A) 



"Look at your answer sheet and de- 
cide which answer best describes 
how the music changed." 

(Pause) 

"Listen again before marking your 
answer to see whether you were 
correct ." 



Stop tape. 

Ask; "WHICH WORD DESCRIBES THE MOST 
IMPORTANT CHANGE?" 



(Sample A repeated) 

"Mark you answer for Sample A." 
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DIRECTIONS 



ON TAPE 



Allow children to answer (correct 
answer is 'louder'). 

Say; "FILL IN THE SQUARE BESIDE 

'LOUDER'. IF YOU HAD A DIFFER- 
ENT ANSWER, ERASE IT." 



After all have corrected Sample A, 
Say: "LET'S TRY ANOTHER SAMPLE." 
Start tape. 



"Listen 



to Sample B." 



(Sample B) 



"Decide what you think is the most 
important change. Look for your 
answer on the sheet, but listen 
again before marking your answer." 



(Sample B repeated) 



Stop tape. 

After children have marked answers. 
Ask; "WHAT DID YOU MARK?" 

After children have answered. 

Say: "LOWER IS RIGHT. ARE THERE ANY 

QUESTIONS ABOUT HOW TO DO THIS?" 
Answer any questions about procedure, 
then. 

Say; "LET'S BEGIN WITH NUMBER ONE." 
Start tape. 



"Mark your answer for Sample B." 



(Items 1 through 6) 

"Turn to the directions at the top 
of page 2." 



(Pause) 

"In this section each answer has 
two parts. For example, the music 
may get higher and slower, or faster 
and louder. Choose the answer that 
has both parts right, but mark only 
one square for each example." 

"Listen to a sample. Sample C." 



(Sample C) 
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DIRECTIONS 



ON TAPE 



- 3 



Stop tape. 

After children have marked answers, 

Ask; "WHICH ANSWER DID YOU MARK?" 

Allow children to answer (higher and 
louder) 

Say: "BE SURE YOU HAVE FILLED IN 

THE ONE SQUARE BESIDE THE TWO 
WORDS ' HIGHER AND LOUDER . ' 

MARK ONLY THE ONE SQUARE WITH 
BOTH PARTS CORRECT. ARE THERE 
ANY QUESTIONS?" 

After answering any questions, 

Start tape. 



Stop tape. 

Ask: "WHICH IS THE CORRECT 

ANSWER?" 

Allow children to answer (lower). 



"Decide the two most important ways 
the music changed and find the an- 
swer; but listen again before mark- 
ing the answer." 

(Sample C repeated) 

"Mark your answer for Sample C." 



(Items 7 through 12) 

"Turn to the directions at the top 
of page 3." 

(Pause) 

"In this group each example has two 
different parts. Listen to both 
parts and decide how the second part 
is most different from the first, 
Mark the square beside the word 
which best describes this difference. 
Mark only one square for each exam- 
ple. Listen carefully as these are 
played, because they will not be re- 
peated. Let's try a sample. Sam - 
ple D." 

(Sample D) 

"Mark the square beside the word 
which tells how the second part was 
different from the first part." 
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DIRECTIONS 



After children have corrected an- 
swers, 

Say: "LET'S TRY ANOTHER SAMPLE." 

Start tape. 



Stop tape. 

Give children time to mark an an- 
swer. 

Ask: "WHICH ANSWER DID YOU MARK?" 

Allow children to answer. 

Say: "SOFTER IS RIGHT." 

Allow children to correct answer. 

Ask: "ARE THERE ANY QUESTIONS?" 

After answering questions, 

Say: "BE SURE TO TELL HOW THE WHOLE 

SECOND PART IS DIFFERENT FROM 
THE WHOLE FIRST PART, NOT JUST 
HOW THE BEGINNING OF THE SEC- 
OND PART IS DIFFERENT FROM THE 
END OF THE FIRST PART. LET'S 
GO AHEAD WITH NUMBER 13. REMEM- 
BER, THERE ARE TWO DIFFERENT 
PARTS 'FOR EACH EXAMPLE, AND 
THESE ARE PLAYED ONLY ONCE." 

Start tape. 



Stop tape. 

Allow time for children to answer. 
Say: "FOLD THE PAGE BACK SO THAT 

THE SHEET WITH YOUR NAME IS ON 
TOP." 

Collect the papers. 



ON TAPE 



" Sample ^." 

(Sample E) 

"Mark your answer for Sample E." 



(Items 13 through 18) 
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Answer Sheet, Manipulative Measure 



Name 



Measuring 



D 



1 . 



D 



2 . 



P 



3. 



P 

L 



4. 



5. 



L 



6 . 



P 

P 

D 



7. 



8 . 



9. 



D 



10 . 



L 



11 . 



P 

P 

D 

D 

L 

L 



12 . 

13. 

14. 

15. 

16. 

17. 

18. 



Drum: 

Drum: 

Chord 

Same: 



Item 



Begin playing fast and 
slower. 



gradually get 



Right 



Wrong 



Begin playing slowly and gradually 
get faster. 



Organ: Using black buttons, find 

the lowest sounding chord. 



Find the highest sounding chord. 



Metal square, resonator bell, triangle, 
(all with rubber mallet) : Play in 

order from loudest to softest. 



Metal square, resonator bell, horseshoe 
(all with rubber mallet): Play in order 

from softest to loudest. 



Resonator bars: Play the highest sounding. 

Resonator bars: Play in order. 

Drum, triangle, resonator bell: Play 

the one which is medium in length. 

Metal square, finger cymbal, resonator 
bell: Play the one which sounds for the 

shortest time. 



Metal square, finger cymbal, resonator 
bell (with rubber mallet): Play in order 

from softest to loudest. 

Chord organ, metal square, resonator bell 
(wooden mallet): Play in order from 

loudest to softest. 



Ukulele; Play strings in order from 
lowest to highest. 

(4) Metal Plates; Play from highest to 
lowest. 

(3) Metal Plates; Play the one which has 
the shortest sound. 

(3) Copper pipes; Play the pipe which 
sounds for the longest time. 

Three notes on chord organ: Play in 

order from loudest to softest. 

Tape recorder: Show the softest, medium, 

and loudest positions. 
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Directions for Administration, Manipulative Measure 



The test administrator greets the child. The exact manner of greeting 
depends upon the appearance and actions of the child and cannot be specified. 
The administrator's manner should be friendly, aiming to relax the child and 
put him at ease. 

The test administrator will then say; I AM GOING TO ASK YOU TO PLAY 
THESE INSTRUMENTS. YOU MAY TRY THEM AS MANY TIMES AS YOU WISH. THEN, 
WHEN YOU HAVE DECIDED UPON YOUR ANSWER, SAY, "THIS IS MY ANSWER," AND 
PLAY YOUR ANSWER FOR ME. DO YOU HAVE ANY QUESTIONS? 



SUGGESTIONS; 

1. The order of placement of the instruments should be constant. The in- 
struments should be numbered so that they c^n be arranged in this order be- 
fore each child enters. 

2. The test administrator should avoid comments that might cue the child. 
When a child looks for approval, the test administrator should say, "Remem- 
ber, when you are ready, say, 'This is my answer,' and then play your an- 
swer." Administrator should avoid saying, after child has indicated his 
choice, "Is that your answer?" or "Are you sure?" He should also avoid 
reinforcing any answers. 

3. If test administrator has not heard the answer, or is not sure he has 
remembered it correctly, he should say, "Will you repeat that, please? I 
didn't hear it." 

4. Test administrator on the first few questions should remind pupil to say 
"This is my answer," and then play his answer. He may, if necessary, also 
remind the pupil that he may try the instruments as much as he wants before 
he arrives at an answer. 
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Classroom Orientation, Overt Measure 



After verbal and listening tests are complete, before leaving: 

ASK: Have you ever thought of the ways you could move to music? 

Wa are going to play some music. Think of some ways you could move. 

PLAY EXCERPT - "Vienna Maidens" by Ziehrer, on Listening tape, about 10 spaces 
after the end of the Listening items. 



After playing, ASK: 

How could you move to that music? 

Get as many responses as possible from class--use raised hands for calling on 
students to answer. (Possible responses--swinging, walking, turning, dancing, 
skipping, etc.) 



ASK: What other things could you do to different music? 



After responses are heard, SAY: 

Listen to this music again and decide whether the biggest change 
was: higher, lower, softer, faster, or slower, louder. 



PLAY EXCERPT AGAIN 

ASK: What was the biggest, most important change? (Answer should be higher ) 

SAY: Later we will be asking some of you boys and girls to show us the 

ways you would move to different kinds of music. Remember some of 
the ways we talked about this morning. 

Thank you very much for helping us with this project. We'll see 
some of you later. 
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Musical Sources of Items and Correct Responses 

Overt Measure 



Musical Source 



Correct Response 



1. Tchaikovsky: Marche Slav 

2. Von Suppe: Poet and Peasant . Overture 

3. Stravinsky: Petrouchka . Russian Dance 

4. Beethoven: Symphony No. 3, Third Movement 

5. Waldteufel: Intermezzo 

6. Lehar: Gold and Silver Waltz 

7. Liszt: Hungarian Rhapsody No . 2 

8. Mozart: Marriage of Figaro , Overture 

9. Bartok; Concerto for Orchestra , Second Movement 



softer 

faster 

slower 

louder 

slower 

higher 

lower 

louder 

higher 
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Answer Sheet, Overt Measure 



r 






Remarks 






















softer 


faster 


slower 


louder 


slower 


higher 


lower 


louder 


higher 


Incorrect 

verbal 

Response 




















Correct 

verbal 

Response 




















Change 
when music 
Changes 




















No 

Change 




















No 

Response 






















1 

rH 


• 

CM 


• 

CO 


• 

Mi- 


« 

in 


• 

VO 


• 


* 

CO 


• 

cn 
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Directions for Administration, Overt Measure 



"In your class we talked about the ways people move to music. Now 
we are going to play some music for you. Listen the first time and think how 
you could move to the music. It may make you think of running, skipping, 
walking, jumping, hopping, stretching, bending, stooping, reaching up, turn- 
ing, or something else. You can use your hands, feet, your head, or your 
whole body. 

"The second time the music plays, do what the music makes you think 
of doing. When the music changes, try to show us by your movement how the 
music changes." 

PLAY EXAMPLE THE FIRST TWO TIMES, INDICATING TO THE CHILD (IF NECESSARY) THAT 
HE SHOULD MOVE THE SECOND TIME. ENCOURAGE HIM TO MOVE IF HE SEEMS HESITANT. 
AFTER PLAYING THE SECOND TIME, STOP THE TAPE (ON NUMBER ONE ONLY) TO SAY; 

"Listen again and then tell the biggest change. Tell whether the 
biggest change is higher, lower, faster, slower, louder, or softer. You 
can look at this card to choose your answer." 

INDICATE CARD. START TAPE AGAIN. IF THE CHILD IS SLOW IN ANSWERING, IT MAY 
BE NECESSARY TO STOP THE TAPE BEFORE NUMBER TWO BEGINS. IF TAPE IS STOPPED, 
SAY; 

"There will be nine of these altogether. Listen, move, listen 
again, and answer. Go ahead with Number 2 now." 

START TAPE, STOPPING ONLY IF REQUIRED FOR THE ORAL ANSWER. 
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Stimulus Card, Overt Measure 



higher 

lower 

louder 

softer 

faster 

slower 
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OE MUSICAL CONCEPTS 
Project 2934 OE-6-10-002 



The Pennsylvania State University 
Department of Music Education 



GENERAL INFORMATION 

Time, Facilities . Personnel , and Related Information 



Description of Measuring Instrument 

The measuring instrument is divided into four submeasures: Listening, 

Verbal , Manipulative , and Overt. The first two submeasures are administered 
in combined form as a group measure in one classroom session; the remaining 
two submeasures are administered separately to individual pupils. 

Group Measure 



Time Required: 



Space Required: 
Equipment Required: 



Individual Measures 
Time Required: 

Space Required: 



Equipment & Facilities 
Required: 



Approximately one hour. Testing will be done within 
school hours. We need information on opening dates 
of school in the fall, any dates when testing will 
be inconvenient, and daily school hours. 

Classroom situation, children seated at regular 
desks, separated if possible. 

Pencil with eraser for each child. For the listen- 
ing measure an electrical outlet is needed for the 
tape recorder. A quiet environment will contribute 
to the validity of this measure. 



Total time 30 minutes per student, with two stu- 
dents out of class at any one time. 

Two rooms, one room large enough for the child to 
move around actively and freely, the other room 
smaller, perhaps office size. 



Both rooms must have electrical outlets and a table 
or desk cleared for use. Other necessary equipment 
will be furnished by the research team. 



Order of Administering Measures 

Group measure first; individual measures second. 

Test Scores and Other Information 

Data Needed: Most recent IQ and standardized reading scores, names of 

tests used, and approximate dates administered. 

Class roll of classes selected for measurement. 
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GENERAL INFORMATION (continued) 

Names of children with other than normal hearing who are 
involved in measurement sample. 

Selection of Students 

The research team will make a random selection from all available classes 
and inform you which class or classes from your district will be included in 
the sample. 



For your information, this measure has been administered to approximately 
sixty children in the pilot and pre-pilot programs. Parts of the measure have 
been given to approximately fifty additional children. There are indications 
that the testing program has been enjoyable for the participants. The mem- 
bers of the research team, all experienced in teaching children, make every 
effort to provide an atmosphere in which the children feel comfortable. 



Research Team: Dr. Frances M. Andrews, 

Dr. Ned C. Delhi, Grace E. Laver ty, 
Cathy J. Cobes, A. Peter Costanza 



OE MUSICAL CONCEPTS 
Project 2934 OE-6-10-002 

The Pennsylvania State University 
Department of Music Education 



QUESTIONNAIRE : Part I* 

1. Official name of school system 

2. Name of chief administrator 

Official title 

Address 

3. Name of Music Supervisor** 

Summer address 

4. Opening date of school 

5. Dates not desirable for visit 

6. School day hours A.M. to P.M. 

7. We will need IQ scores and standardized reading test scores. Do you prefer that 

we extract these scores when we visit the schools, or will it be more convenient 
for you to provide them? 

8. Please list by room number, by teacher* s name, or by some other means all avail- 
able 4th grade classes in buildings with adequate space for the measurement pro- 
cedure. It is important that homogeneous and heterogeneous groupings be listed 
in the appropriate columns. Please exclude Special Education classes. 

SEE EXAMPLE BELOW, 



EXAMPLE 



Building 


Location 


Heterogeneous 

Classes 


Homogeneous Classes: 
How Grouped? 


Approx. No. 
of Pupils 
in Class 


Comments 


Lincoln 


Washington 

Twp. 




Room 211; Miss Hale 
IQ's above 140 


30 


Alert , interested ; 
good situation 


i 

Lincoln 


Washington 

Twp. 


Room 212 




25 




Franklin 


Adams 

Borough 


Room 7 




40 


Uncooperative 

atmosphere 


Stevens 


Polk 

City 


Miss Smith 
Room 3 




20 


Pupils are predomi- 
nantly rural 


Stevens 


Polk 

City 


Mrs . Warner 
Room 4 




25 


Slum area; Title I 
school 


Stevens 


Polk 

City 




Room 213: Low 
achievement scores 
in reading 


30 





*To be completed by Music Supervisor or other designated person. 

**If a person other than Music Supervisor is school coordinator for this study, insert his 
name and address instead of Music Supervisor. 
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QUESTIONNAIRE: Part I (continued) 



Form for Describing Available Classes 



Building 


Location 


Heterogeneous 

Classes 


Homogeneous 
Classes: 
How Grouped? 


Approx . No . 
of Pupils 
in Class 


Comments 
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Part II 

The following information should be supplied by the chief school administra- 
tor or his administrative assistant. 

Please give this sheet to the proper person, asking that he complete and re- 
turn the form to us. 

Name of school system 

Location 

School enrollment: 

Secondary (7 through. 12) 

Elementary (1 through 6) 

Total 4th grade 

Predominant population characteristics of the school district (in percentages) 

Urban Surburban Rural 

ApproKimate per cent of parents in the following general categories: 

Professional - Business Semi-Professional (office workers) 

Skilled Labor (factory workers) Unskilled Labor 

Agriculture 

Assessed property value of the school district 

Political subdivisions of the school district, listed by name: 

Cities 

Towns 

Boroughs 

Townships 



Please list any Title I schools in the school district. 



Return to: 

Mrs. Diane Gold 
Music Education Department 
263 Chambers Building 
University Park, Pennsylvania 



Completed by: 
Official title 

Address 

Date 






Appendix E: Computer Programs Used in Statistical Analyses 



S3nnmetric Correlation Program (COREL) 

Item Analysis (ITANL) 

Complete Analysis of Variance Reliability (RELIB) , 



PAGE 
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ITANL 



ITEM ANALYSIS 11.0.014 

Walter Dick 

Richard E. Spencer The Pennsylvania State Univ. 

Office of Examination Services May 25, 1964 



General Description ; 

1. This program computes for each test item; the biserial correlation*, 
the point-biserial and the t-value associated with it, the difficulty 
level (p) , the mean total score of the students getting the item 
correct, and the mean total score of the students who miss the item. 

2. Summary information including the mean difficulty index, mean bi- 
serial correlation, an estimate of interitem correlation, the stan- 
dard error of the correlation, Kuder-Richardson #20 reliability, and 
test mean, variance and standard deviation are also computed. 



Output ; 

1. For each item; item number, correct answer, a table indicating the 
frequency with which each of the answer choices was selected by each 
subgroup in the test population, a total frequency with which each 
answer choice was selected; each of the statistics described in 
section 1 of General Description and, if desired, an evaluation of 
item difficulty, item discrimination, and a total item evaluation. 

2. For the total test; tables of frequency distributions of the test 
items with respect to item difficulty and item discrimination, plus 
the statistics described in section 2 of the General Description. 



Definitions ; 

In this description, the following definitions apply; 

Data deck ** a group of card representing the responses of all the 
testees on 50 or fewer items. 

Run * the processing of all the data decks which have been submitted. 
This would Include up to 17 data decks if the cards are hand punched. 
There is only one data deck per run if Digitek Cards are used. 



Capabilities and Limitations ; 

1. Only one test may be analyzed per run. 

2. Any number of students may be used. 

^Computational procedure from Computation Center program Sll.3.006, BISER. 
J. E. Singer 
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3. Total score or criterion score of any individual cannot exceed 999. 

4. If the tests have been scored by the Digitek Optical Scanner, the 
test cannot include more than 150 items. 

5. If the test data cards are hand punched, any number of items up to 
900 may be processed on one run (see section on Data Preparation). 

6. The population which has been tested may be subdivided into two, 
three, four or five groups in order to check the responses selected 
by various subgroups. For example, the testees could be subdivided 
into males and females, or in terms of their total score, i.e. lowest 
one-fifth, second fifth, middle fifth, fourth fifth and highest fifth. 
This subgrouping does not effect the item statistics and is not a 
procedural requirement of the program. (See section of Data Prepara- 
tion below.) Subgrouping is not required. 

7. No item can have more than one correct answer. If a test is being 
processed which does not necessarily have correct answers, arbitrary 
"correct answers" must be submitted. 

8. There can be no more than 5 possible response alternatives to any 
test items. (If an "omit"’"is counted as a response alternative, there 
are then 6 possible responses.) See section 1 of Data Preparation 
below. 

9. There can be no double punched card columns, except for Digitek pre- 
pared cards, and then only in card column 80. 



Preparation of Data Cards ; 

1. Hand punched cards. 

Hand punched data may be used in standard format, which is total score 
or criterion score in cols. 1-3 (right justified) and item responses (i.e. 1, 
2 3, 4, 5 or 0 for an omit) in cols. 4-53 with one response per column. 
Standard format is indicated with a "1" in col. 60 of the parameter card. Or, 
a format may be supplied by the user. This format must provide for reading 
the total score first and then up to 50 items, both of which are to be read 
in integer format, e.g. (7X, 13, 3512/lOX, 1512). Format at object time is 
indicated with a "2" in col. 60 of the parameter card. 

It is important to note that this program analyzes the data for up to 
50 items at one time. If the data is to be hand punched, and, there are more 
than 50 items in the test, separate data decks of cards must be made for each 
set of 50 items. Although the data is punched separately for each set of 50 
items in the test, the items will be sequentially numbered in the print-out 
and the summary data will be for the total number of test items. 
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11.0.014 

The program also requires that five cards with punched in cols. 1 

and 2 be used with each data deck of 50 items or less. (If format is being 
entered at object time, the (-l)'s must be punched in the same card cols, as 
the total score in punched.) These cards are used to indicate the end of 
data for various subgroups within the testae population. The fifth~l) card 
indicates the end of the data for one data deck of items. If the data cards 
for 40 students are sorted on total score from lowest to highest, five sub- 
poups could be formed by placing a (-1) card after the data cards for the 
8th., 16th.# 24th., 32nd., and 40th. students. If the population were to be 
dicotomized into males and females, a (-1) card would be inserted after the 
card for the last male student and ^ (-1) cards after the card for the last 
female. If no grouping is desired, 5 (-1) cards would be inserted after the 

data deck for up to 50 items. This process qiust be repeated for each data 
u 0clc • 



Approximately 18 data. decks representing 50 items each may be submitted 
for one run when the cards are hand punched and only one print-out is wanted. 
If more copies are needed, the number of items to be processed must be re- 
duced in order not to exceed maximum number of records. 

2. Digitek punched cards. 

If the item analysis cards have been punched by the Digitek Optical 
Scanner, the user is limited to analyzing 150 items per run. No matter how 
many items are being processed, there will always be 2 Digitek cards per 
student. If subgroups are to be used, these cards can be sorted appropriately 
with each student's two cards staying together. 

Five cards with (-1) punched in cols. 1 and 2 must be inserted into the 
Digitek data deck. These cards are used to indicate the end of subgroups 
within the testee population. See paragraph 3 of hand preparation of data 
cards for examples. The identical procedure is used for the Digitek cards 
with the exception that there will only be one data deck regardless of the 
n^ber of test items (up to 150). This is the exception to the definition 
of data deck which appears on page 1. 



Input Deck ! 

If hand punched cards are used; 

A. Parameter card 

B. Correct answer card 

C. Format card, if necessary 

D. Data cards, including 

5 (-1) cards 

E. Repeat A-B-C-D as necessary 

F. Blank card - end of run* 

*Number of copies of the test summary 
col. 65 of the end of run card. 



If Digitek cards are used; 



A. 


Parameter card 


B. 


Correct answer card 


C. 


All data cards, including 
5 (-1) cards 


D. 


Repeat A and B as necessary 


E. 


Blank card - end of run* 


which are 


needed should be punched in 
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Parameter Card: 




Cols. 1-4 


Test Code (Numeric) (Some Number must 
be punched). 


Cols. 5-14 


Course Name 


Cols. 15-24 


Instructor or identification 


Cols. 30-35 


Date (e.g. 121164 which will be printed 
12/11/64) 


Cols. 37-40 


Total number of students (right justified) 


Cols. 43-45 


No. of items in this data deck - right 
justified. (This is not the total 
number of items in the run.) 


Col. 50 


Item evaluation option 

a. If wanted, punch a 1 

b. If unnecessary, punch a 2 


Col. 55 


No. of answer choices to the test items 
(i.e. 2, 3,4, or 5) . 


Col. 60 


Punch 0 if Digitek cards are used. 



Punch 1 if data cards are hand punched 
in standard forraat, i.e. total score 
or criterion in cols. 1-3, item 
responses in cols. 4 - 53. 

Punch 2 if hand punched data cards are 
punched in other than standard format. 

Col. 65 Number of print-out copies needed. 

If 1, punch 1, etc. Up to 9 copies 
can be printed. 



Correct Answer Card ; 

Beginning with column 1, punch the correct answer for each item, one 
answer per column. Total number of items cannot exceed 50 for any one data 
deck. No identification is punched in this card. Each item can have only 
one correct answer. 

Answer choices A, B, C, D, E, and omit “ 1, 2, 3, 4, 5 and 0. 



Format Card ; 

If there is a 2 column 60 of the parameter card, the user must supply 
his own format card for reading the data. The format roust provide for read 
ing up to (but no more than) 50 items, and must be in fixed-point form. 
Example: (7X, 13, 3512/lOX, 1512) 
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Data Cards ; 

See the appropriate section under Data Preparation. 



MaKlmum Time Estimate ; 

For up to 300 students, approKlmately 5 seconds per Itemj 300 to 600 
students, 10 seconds per Item, etc. For 200 students and 60 Items, time - 
60 3C 5 « 300. 1£ Dlgltek cards are being used, add 100 seconds. 



Maximum Records Estimate; 



Approximately 25 records 
put copies, records - 25 x 60 


per Item per copy. 
X 2 « 3000. 


With sixty Items and 2 out- 


WARNING - Maximum records for one computer 


run Is 25,000. 


Formulas Used: 






1. X - gX 
N 






2 2 
2. fT - Nj;X^ - 






N (N 


- 1) 





3 • 0 / 2 

^ 1 

®l\or. - / N - 1 



5. 


p - 


N correct 






/ 


N 




6. 


’^bls 1 


X •* X 

■ cor incor 

* P.q 

n 


* ■ multiply 






y 




7. 


^p.bls 


■ ’^bls ^ V 








/ pq 




8. 


*^p-bls 


m ^bls * / N-2 

1 2 
1"^ . . 
p-bla 
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9. Reliability (K-R #20) = K 

c 



K-1 



■) ( 



a - E pq 

2 

a 



11 . 



^diff 



K 



<! 



10. Mean Interitem Correlation = ( ^^bis ) 

K 



j-i 

’ 3 - 



For interpretation of the output of this program, contact the Office of Exami 
nation Services. 
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COMPLETE A.O.V. RELIABILITY 
Internal Consistency 



RELIB 



Walter Dick March, 1965 

Office of Examination Services Pennsylvania State University 



Description ; 

The program will compute the internal consistency reliability of test 
data. The coefficient of reliability is determined through the use of the 
analysis of variance. It is exactly equivalent to the Kuder-Richardson 
Formula 20 estimate (Guilford, 1954, p. 383). The data may be of two types: 

(1) binary coded (1 and 0) to indicate correct and incorrect responses, or 

(2) continuous (1, 2, 3, 4, 5, ...) as on an attitude questionnaire. 

In addition, the program will compute the reliability of the data when 
item clusters are specified by the user. That is, if the items may be 
grouped in some meaningful way such as test items 1-5 refer to reading 
paragraph 1, items 6 - 10 to paragraph 2, etc., then a strata random and 
strata fixed reliability can be computed (Rabinowitz and Eikeland, 1964). 

If a parallel test were constructed, and the same types of paragraphs 
were selected, then the strata fixed estimate of reliability would be se- 
lected. If, however, paragraphs from different content areas or with differ- 
ent characteristics had been selected, then the strata random estimate would 
have been selected. 

The program also computes the internal consistency reliability of each 
of the strata which are identified by the user and indicates the product- 
moment correlation among the scores on each of the. strata. 

If there are strata within the data, the data must be submitted with the 
items in each strata punched consecutively. 

If the over-all reliability of the data only is desired, an "item per 
strata" card, containing the total number of items, still must be inserted. 

Limitations; 



a. Up to 500 total items may be used 

b. Up to 40 strata may be used 

c. Number of students unlimited 

Input Deck ; 

a. Object deck 

b. Parameter card 

c. Item per strata card 

d. Format card 

e. Data cards 

f. Repeat b through e 

g« End of run - blank card 
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Parameter Card: 



Col. 1-4 
Col. 5-8 
Col. 9 -12 
Col. 16 

Col. 21 -40 



No. of strata or item clusters (right justified) 
Total no. of test items (right justified) 

No. of subjects (right justified) 

= 1 for total reliability only 
= 0 for stratified reliability estimates also 
Problem identification 



Item per Strata Card : 

Indicate the number of items in each strata; 



Col. 1-2 
Col. 3-4 
Col. 5-6 
etc . 



No. of items in strata 1 (right justified) 
No. of items in strata 2 (right justified) 
No. of items in strata 3 (right justified) 



Format Card : (required) 

A format for reading the item responses in floating point mode must be 
submitted. If 50 items were submitted, punched in the first 50 cols., the 
format would be (50F1.0). 

Data Cards: 



Data cards must be punched to exactly conform to she format which is 
submitted. 

Maximum Time: 



One third second per card 
Maximum Records Estimate : 

50 for each test being analyzed. 

References : 

Guilford, J. P. Psychometric methods . New York: McGraw Hill, 1954. 

Rabinowitz, W. and Eikeland, H. M. Establishing the reliability of 
tests with clustered items. Pedagogisk Forskning , 1964, p. 85-106. 
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THE PENNSYLVANIA STATE UNIVERSITY 
Computation Center 



SYMETRIC CORRELATION PROGRAM 
FLOATING INPUT 



COREL 

11.3.003 (Revised) 



M. E. Roberts 

A. T. Wink January 1964 



GENERAL DESCRIPTION 

This program computes correlation coefficients of all possible pairs of 
variables plus the means and standard deviations of all variables. 

OUTPUT 

Correlations are printed in a triangular matrix followed by tables of 
means and standard deviations. All values are rounded to 3 decimals. 

If multiple regression or factor analysis is to be computed, sets of 
cards containing means, standard deviations and correlations are punched for 
variable 1 through the variable stated on the parameter card. Means and 
standard deviations are rounded to 3 decimals and correlations to 5. 

CAPABILITIES AND LIMITATIONS 

Number of variables may not exceed 105 
Data, read with Standard Format, is 4-digit 

Data size and/or decimal designation may be changed by inserting a 
format at object time 

Decimals punched in data take precedence over format specification 
Number of cases is unlimited, but if the number of cases is very large, 
accuracy will decrease as indicated under method. 

If signs are punched they precede the data 
Positive signs usually are not punched 
Negative signs must be punched 

Case numbers must be positive, non-zero, and generally distinct 
Successive case numbers may never be identical 

Cards per case are numbered sequentially starting with 1. Sequence 
within each case is checked. 

If the total no. of variables can be punched on one card, case numbers 
and sequence numbers are not used. (See PARAMETER CARD - Col. 23) 

METHOD 

All computations are done in single precision floating point arithmetic. 
This provides 8-digit values for computing. Data of larger magnitude (4- or 
5-digit data) or problems with large numbers of cases (over 1000) will develop 
output of lowered acc^vc>‘y. 
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SYMETPaC CORRELATION PROGRAM 



COREL 



11.3.003 (Revised) 



INPUT 

A. PARAMETER CARD 

B. FORMAT CARD (if format choice is "2”) 

C. DATA CARD 

D. TRANSFER CARD (not used at end of final data set) 

Repeat A, B, C, D as desired 

E. END-OF-RUN CARD - Used only after end of F. D. S. 

PARAMETER CARD 



Right justify ail values except problem name. 
Unused columns may be unpunched or zeroes. 



Col. 
Col. 
Col . 
Col. 



1-9 

12-14 

16-18 

21 



Col. 23 



Col. 26-29 



Col. 31-38 



PARAMETER 
Number of variables (,< 105) 

Number of variables per card 
Format choices are 1, 2, or 3. 

1. Standard format applies: there are 2 possible 

forms depending upon whether Col. 23 of para- 
meter card is 0 or 1. (See Col. 23) 
la. If Col. 23 of parameter card is 0; Standard 
format is (214, 18F4.0) 

Data cards are then: 

Col. 1-4 Case number 

Col. 5-8 Card sequence number 

Col. 9-80 Eighteen 4-digit fields for 

variables 

Jb. If Col. 23 of parameter card is 1; Standard 
format is (20F4.0) 

Data cards are then: 

Col. 1-80 Twenty 4-digit fields for 

variables 

Format card, inserted with this problem, applies. 
Format card, most recently inserted, applies. 
Before choice 3 may be used the user must have 
inserted his own format card under choice 2 in 
the previous problem. 

0 or 1 

a. 0 if case and sequence numbers are necessary, 

"" Must be 0 is TN0V>No. of variables per card. 

h. Must be 1 if total No. of variables number 

of variables per card. 

Number of last variable to be used for Factor Analysis 
or Multiple Regression. Means, standard deviations and 
correlations will be punched for all variables from 1 
through number punched here. If no cards are desired, 
leave these columns blank. Must be 64 or less for 
MREG; 105 or less for FAN , 

Problem name 



2 . 

3. 
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SYMETRIC CORRELATION PROGRAM 



COREL 

11.3.003 (Revised) 



FORMAT CARD 




FORMAT 



NOTE: Col. 1-6 



(la, Ib, cFw.d) if 0 in Col. 23 of parameter card 



must have the word FORMAT. 



a = width of field for case numbers 
b = width of field for card sequence numbers 

c = number of variables per card, (parameter card Cols. 17-18) 
w = width of field for each variable 
d = number of places after decimal to be read. 



^ FORMAT (cFw.d) 



if 1 in Col. 23 of parameter card, c, 
w, d, are interpreted as above. 



DATA CARDS 

if 0 in Col. 23 of parameter card 



A “ case number - Must be positive 
B = Card sequence number 
V = Variables 

All values right justified 






A B Vj Vj 




if 1 in Col. 23 of parameter card 



TRANSFER CARD 

Col. 1-8 TRANSFER 

END-OF-RUN CARD 

Col. 1-3 END 



EQUATIONS 

X, 







N 

(N-1) 
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SY^IETRIC CORRELATION PROGRAM 



COREL 

11.3,003 (Revised) 



DESCRIPTION OF CARD OUTPUT 



If cards are requested, a single card deck is returned for each batch 
of problems regardless of the number of problems requiring cards. 

This deck can be separated into problem decks by interpreting the first 
60 columns (use standard panel) and reading Cols. 1-8 which contain problem 
name. 



The 

Layout 


order of 
of Mean 


cards must not be changed, 
and Standard Deviation Cards 


Cols. 


1-8 


Problem name 


Cols . 


9-17 


Type 






Means 






Standard deviations 


Cols . 


18-20 


Card sequence within type 


Cols. 


21-80 


Six 10-digit fields for means and sta: 


Layout 


of Correlation Cards 


Cols . 


1-8 


Problem name 


Cols. 


9-13 


Type - COR 


Cols. 


14-16 


Card sequence within type 


Cols. 


17-80 


Eight 8-digit fields for correlations 



MAXIMUM TIME 



300 seconds plus 20 seconds for each 100 data cards 
MAXIMUM OUTPUT RECORDS 
500 

NOTE ; The differences between this program and the original COR-F are that: 
(a) card sequence and case numbers are not necessary if the no. of 
variables » no. of variables per card. This option is explained under 
Col. 23 of the parameter card; (b) Standard deviations are computed 
using n-1 degrees of freedom rather than n degrees of freedom; and 
(c) control cards are mnemonic in designation. 
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(TOP) 



001 

100 

101 

102 

103 

200 

300 

310 

320 

330 

340 

350 

400 

500 

501 

600 

601 

602 

603 

604 

605 

606 

607 

800 

801 

802 

803 

804 

805 

806 

807 

808 

809 

810 
811 
812 

813 

814 

815 

816 

817 

818 

819 

820 
821 
822 
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NETHIEVAU TERMS 

Musical concepts 
Pitch 
Duration 
Loudness 

Musical elements 
Musical Concept Measures 
Music Tests 



Elementary music tests 
Fourth-grade music 



IGENTIML^S 

Battery of Musical 



A multimodal technique (The Battery of Musical Concept Measures) was developed for 
identifying elenentary school children's concepts of pitch, duration, and loudness. 
The Battery included four measures: Verbal, written responses to written stimuli; 

Listening, written responses to multidimensional musical stimuli; Manipulative, 
oral responses to subject-manipulated stimulus materials; 

and oral responses to musical stimuli. Reliabilities were: Verbal, Listen- 

ing, .85; Manipulative, .66; Overt, .64. The group measures, Verbal and Listening, 
were administered to 429 fourth-grade children from sixteen r.indomly-selected 
classrooms in twelve representative Pennsylvania school districts; the individual 
measures, Manipulative and Overt, were administered to 214 subjects randomly se- 
lected from the total sample. Written scores for pitch, duration, and loudness 
were derived from combining the group measures; nonwritten f 

the individual measures. Reliabilities were: Written Pitch, .71, Written Dura 

tion, .69; Written Loudness, .60; Nonwritten Pitch, .65; Nonwritten Duration, .55, 
Nonwiitten Loudness, .34. Content validity was established by consensus of ex- 
ports; construct validity was investigated in relation to a multitrai t-mul timethoc 

Ltrix. Correlations and partial correlations with IQ Ztllnis 

symmetric correlation matrix of derived scores are reported. A secondary analysis 
ofthe Overt Measure showed no significant relationship between overt and oral 
responses, and indicated seme confusion by subjects in labeling musical changes. 
Although further refinement is needed before the Battery is acceptable for practi- 
cal use, the present measures are adequate for research purposes. 
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